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Phytases (myo- inositol hexakisphosphate phosphohydrolases; EC 3.1.3.8) are 
5 enzymes that hydrolyze phytate (myo-inositol hexakisphosphate) to wyo-inositol and 
inorganic phosphate and are known to be valuable feed additives. 

A phytase was first described in rice bran in 1907 [Suzuki et al., Bull. Coll.. Agr. 
Tokio Imp. Univ. 2, 495 (1907)] and phytases from Aspergillus species in 191 1 [Dox and 
Golden, J. Biol. Chem. 1Q, 183-186 (1911)]. Phytases have also been found in wheat bran, 
10 plant seeds, animal intestines and in microorganisms [Howsen and Davis, Enzyme Microb. 
Technol. 5, 377-382 (1983), Lambrechts et al., Biotech. Lett. 14, 61-66 (1992), Shieh and 
Ware, Appl. Microbiol. 16, 1348-1351 (1968)]. 

The cloning and expression of the phytase from Aspergillus niger (ficuum) has been 
described by Van Hartingsveldt et al, in Gene, 122, 87-94 (1993) and in European Patent 
15 Application, Publication No. (EP) 420 358 and from Aspergillus niger var. awamori by 
Piddington et al., in Gene 122, 55-62 (1993). 

Cloning, expression and purification of phytases with improved properties have been 
disclosed in EP 684 313. However, since there is a still ongoing need for further improved 
phytases, especially with respect to their thermostability, it is an object of the present 
20 invention to provide the following process which is, however, not only applicable to 
phytases. 

A process for the preparation of a consensus protein, whereby such process is 
characterized by the following steps: 

a) at least three, preferably four amino acid sequences of a defined protein family 
25 are aligned by any standard alignment program known in the art; 

b) amino acids at the same position according to such alignment are compared 
regarding their evolutionary similarity by any standard program known in the 
art, whereas the degree of similarity provided by such a program which defines 
the least similarity of the amino acids that is used for the determination of an 

30 amino acid of corresponding positions is set to a less stringent number and the 

parameters are set in such a way that it is possible for the program to determine 
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from only 2 identical amino acids at a corresponding position an amino acid for 
the consensus protein; however, if among the compared amino acid sequences 
are sequences that show a much higher degree of similarity to each other than 
to the residual sequences, these sequences are represented by their consensus 
sequence determined as defined in the same way as in the present process for 
the consensus sequence of the consensus protein or a vote weight of 1 divided 
by the number of such sequences is assigned to every of those sequences; 

c) in case no common amino acid at a defined position can be identified by the 
program, any of the amino acids of all sequences used for the comparison, 
preferably the most frequent amino acid of all such sequences is selected or an 
amino acid is selected on the basis of the consideration given in Example 2; 

d) once the consensus sequence has been defined, such sequence is back- 
translated into a DNA sequence, preferably using a codon frequency table of 
the organism in which expression should take place; 

e) the DNA sequence is synthesized by methods known in the art and used either 
integrated into a suitable expression vector or by itself to transform an 
appropriate host cell; 

f) the transformed host cell is grown under suitable culture conditions and the 
consensus protein is isolated from the host cell or its culture medium by 
methods known in the art. 

In a preferred embodiment of this process step b) can also be defined as follows: 
b) amino acids at the same position according to such an alignment are compared regarding 
their evolutionary similarity by any standard program known in the art, whereas the degree 
of similarity provided by such program is set at the lowest possible value and the amino 
acid which is the most similar for at least half of the sequences used for the comparison is 
selected for the corresponding position in the amino acid sequence of the consensus 
protein. 



In another preferred embodiment the consensus sequence is used in order to improve 
a specific protein. In this process first a consensus sequence is determined from a number 
of highly homologous sequences according to steps a), b) and c) as described above. In a 
second step the amino acid sequence of another protein which is homologous to the 
consensus sequence is compared with the consensus sequence and in a third step only 
those amino acid residues are replaced in the amino acid sequence of the other protein 
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which clearly differ from the consensus sequence of this protein family calculated under 
moderately stringent conditions whereas at all positions of the alignment where no 
preferred single amino acid can be determined under moderately stringent conditions the 
amino acids of the other protein remain unchanged. 

5 By using this preferred embodiment the consensus sequence derived from a number 

of highly homologous sequences is used in order to replace only certain amino acid 
residues in the protein in such a manner that only those amino acid residues are replaced 
which clearly and unambiguously differ from the corresponding consensus sequence of this 
protein family which has been calculated on moderately stringent conditions. At all other 
10 positions of the alignment, however, where the method of the present invention is not able 
to determine clearly a preferred amino acid residue under moderately stringent conditions 
the amino acid residues of the other protein are maintained unchanged. 

A further preferred embodiment is a process wherein at first a consensus sequence is 
determined from homologous sequences as described above. In a second step the active 

15 center of the protein comprising all amino acid residues that are involved in forming the 
active center is determined in the consensus sequence and in the sequence of a single 
homologous protein as well. The single homologous protein may have preferred properties 
like high specific activity or different pH dependency of enzymatic activity. In a third step 
some or all amino acid residues that are involved in forming the active centre of the 

20 homologeous protein are inserted into the backbone of the consensus sequence. The result 
thereof is a chimeric protein having the active centre derived from a single protein and the 
backbone of the consensus sequence. 

The active centre of the protein can be determined e.g. by using any analysis of the 
three-dimensional structure of the protein, e.g. by homology modelling on the basis of a 
25 known 3D-structure of a known protein. Frequently the single homologeous protein is an 
enzyme. 

It is furthermore an object of the present invention to provide such a process, 
wherein the program used for the comparison of amino acids at a defined position 
regarding their evolutionary similarity is the program "PRETTY". It is more specifically an 
30 object of the present invention to provide such a process, wherein the defined protein 
family is the family of phytases, especially wherein the phytases are of fungal origin. 

It is furthermore an object of the present invention to provide such processes, 
wherein the host cell is of eukaryotic, especially fungal, preferably Aspergillus or yeast, 
preferably Saccharomyces or Hansenula origin. 
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It is also an object of the present invention to provide a consensus protein obtainable 
preferably obtained, by such processes and specifically the consensus protein, which has 
the amino acid sequences shown in Figures 2, 4 and 6 or a variant thereof. A "variant" 
refers in the context of the present invention to a consensus protein with amino acid 

5 sequence shown in Figure 2, 5, 7, and 8 wherein at one or more positions amino acids have 
been deleted, added or replaced by one or more other amino acids with the proviso that the 
resulting sequence provides for a protein whose basic properties like enzymatic activity 
(type of and specific activity), thermostability, activity in a certain pH-range (pH-stability) 
have not significantly been changed. "Significantly" means in this context that a man 

10 skilled in the art would say that the properties of the variant may still be different but 
would not be unobvious over the ones of the consensus protein with the amino acid 
sequence of Figure 2 itself. 

A "mutein" refers in the context of the present invention to replacements of the 
15 amino acid in the amino acid sequences of the consensus proteins shown in Figure 2 which 
lead to consensus proteins with further improved properties e.g. activity. Such muteins can 
be defined and prepared on the basis of the teachings given in European Patent Application 
number 97810175.6, e. g. Q50L, Q50T, Q50G, Q50L-Y51N, or Q50T-Y51N. "Q50L" 
means in this context that at position 50 of the amino acid sequence (Figure 2) the amino 
20 acid Q has been replaced by amino acid L. 

In addition, a food, feed or pharmaceutical composition comprising a consensus 
protein as defined above is also an object of the present invention. 

In this context "at least three preferably four amino acid sequences of such defined 
protein family" means that three, four, five, six to 12, 20, 50 or even more sequences can 

25 be used for the alignment and the comparison to create the amino acid sequence of the 
consensus protein. "Sequences of a defined protein family" means that such sequences fold 
into a three dimensional structure, wherein the alpha-helixes, the beta-sheets and beta-tums 
are at the same position so that such structures are, as called by the man skilled in the art, 
largely superimposable. Furthermore these sequences characterize proteins which show the 

30 same type of biological activity, e.g. a defined enzyme class, e.g. the phytases. As known 
in the art, the three dimensional structure of one of such sequences is sufficient to allow 
the modelling of the structure of the other sequences of such a family. An example, how 
this can be effected, is given in the Reference Example of the present case. "Evolutionary 
similarity" in the context of the present invention refers to a scheme which classifies amino 

35 acids regarding their structural similarity which allows that one amino acid can be replaced 
by another amino acid with a minimal influence on the overall structure, as this is done e.g. 
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by programs, like "PRETTY", known in the art. The phrase "the degree of similarity 
provided by such a program...is set to less stringent number" means in the context of the 
present invention that values for the parameters which determine the degree of similarity in 
the program used in the practice of the present invention are chosen in a way to allow the 

5 program to define a common amino acid for a maximum of positions of the whole ammo 
acid sequence, e. g. in case of the program PRETTY a value of 2 or 3 for the 
THRESHOLD and a value of 2 for the PLURALITY can be choosen. Furthermore, "a vote 
weight of one divided by the number of such sequences" means in the context of the 
present invention that the sequences which define a group of sequences with a higher 

10 degree of similarity as the other sequences used for the determination of the consensus 
sequence only contribute to such determination with a factor which is equal to one divided 
by a number of all sequences of this group. 

As mentioned before should the program not allow to select the most similar amino 
acid the most frequent amino acid is selected, should the latter be impossible the man 
15 skilled in the art will select an amino acid from all the sequences used for the companson 
which is known in the art for its property to improve the thermostability in proteins as 
discussed e.g. by 

Janecek, S. (1993), Process Biochem. 28, 435-445 or 
Fersht, A. R. & Serrano, L. (1993), Curr. Opin. Struct. Biol. 3, 75-83. 
20 Alber, T. (1989), Annu. Rev. Biochem. 58, 765-798 or 
Matthews, B. W. (1987), Biochemistry 26, 6885-6888. 
Matthews, B. W. (1991), Curr. Opin. Struct. Biol. 1, 17-21. 



25 



30 



The stability of an enzyme is a critical factor for many industrial applications. 
Therefore, a lot of attempts, more or less successful, have been made to improve the 
stability preferably the thermostability of enzymes by rational (van den Burgefa/., 1998) 
or irrational approaches (Akanuma et al, 1998). The forces influencing the thermostability 
of a protein are the same as those that are responsible for the proper folding of a peptide 
strand (hydrophobic interactions, van der Waals interactions, H-bonds, salt bridges, 
conformational strain (Matthews, 1993). Furthermore, as shown by Matthews et ah (1987), 
the free energy of the unfolded state has also an influence on the stability of a protein. 
Enhancing of protein stability means to increase the number and strength of favorable 
interactions and to decrease the number and strength of unfavorable interactions. It has 
been possible to introduce disulfide linkages (Sauer et al, 1 986) to replace glycine with 
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alanine residues or to increase the proline content in order to reduce the free energy of the 
unfolded state (Margarit et al, 1992; Matthews, 1987a). Other groups concentrated on the 
importance of additional H-bonds or salt bridges for the stability of a protein (Blaber et al, 
1993) or tried to fill cavities in the protein interior to increase the buried hydrophobic 
surface area and the van der Waals interactions (Karpusas et al, 19898). Furthermore, the 
stabilization of secondary structure elements, especially a-helices, for example, by 
improved helix capping, was also investigated (Munoz & Serrano, 1995). 

However, there is no fast and promising strategy to identify amino acid replacements 
which will increase the stability, preferably the thermal stability of a protein. Commonly, 
the 3D structure of a protein is required to find locations in the molecule where an amino 
acid replacement possibly will stabilize the protein's folded state. Alternative ways to 
circumvent this problem are either to search for a homologous protein m a thermo- or 
hyperthermophile organism or to detect stability-increasing amino acid replacements by a 
random mutagenesis approach. This latter possibility succeeds in only 10 3 to 10 mutations 
and is restricted to enzymes for which a fast screening procedure is available (Arase et al, 
1993- Risse et al, 1992). For all these approaches, success was variable and unpredictable 
and, if successful, the thermostability enhancements nearly always were rather small. 

Here we present an alternative way to improve the thermostability of a protein. 
Imanaka et al (1986) were among the first to use the comparisons of homologous proteins 
to enhance the stability of a protein. They used a comparison of proteases from 
thermophilic with homologous ones of mesophilic organisms to enhance the stability of a 
mesophilic protease. Serrano et al (1993) used the comparison of the amino acid sequences 
of two homologous mesophilic RNases to construct a more thermostable Rnase. They 
mutated individually all of the residues that differ between the two and combined the 
mutations that increase the stability in a multiple mutant. Pantoliano et al (1989) and, in 
particular, Steipe et al (1994) suggested that the most frequent amino acid at every position 
of an alignment of homologous proteins contribute to the largest amount to the stability of 
a protein Steipe et al (1994) proved this for a variable domain of an immunoglobulin, 
whereas Pantoliano et al (1989) looked for positions in the primary sequence of subUhsin 
in which the sequence of the enzyme chosen to be improved for higher stability was 
singularly divergent. Their approach resulted in the replacement M50F which increased the 
T m of subtilisin by 1 . 8 °C. 

Steipe et al (1994) proved on a variable domain of immunoglobulin that it is 
possible to predict a stabilizing mutation with better than 60% success rate just by using a 
statistical method which determines the most frequent amino acid residue at a certain 
position of this domain. It was also suggested that this method would provide useful results 
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not only for stabilization of variable domains of antibodies but also for domains of other 
proteins However, it was never mentioned that this method could be extended to the entire 
protein. Furthermore, nothing is said about the program which was used to calculate the 
frequency of amino acid residues at a distinct position or whether scoring matrices were 
used as in the present case. 

It is therefore an object of the present invention to provide a process for the 
preparation of a consensus protein comprising a process to calculate an amino acid residue 
for nearly all positions of a so-called consensus protein and to synthesize a complete gene 
from this sequence that could be expressed in a pro- or eukaryotic expression system. 

DNA sequences of the present invention can be constructed starting from genomic or 
cDNA sequences coding for proteins, e.g. phytases known in the art [for sequence 
information see references mentioned above, e.g. 

EP 684 313 or sequence data bases, for example like Genbank (Intelligenetics, California, 
USA) European Bioinformatics Institute (Hinston Hall, Cambridge, GB), NBRF 
(Georgetown University, Medical Centre, Washington DC, USA) and Vecbase (University 
of Wisconsin, Biotechnology Centre, Madison, Wisconsin, USA) or disclosed « the 
figures by methods of in vitro mutagenesis [seee.g. Sambrook et al., Molecular Cloning, 
Cold Spring Harbor Laboratory Press, New York]. A widely used strategy for "site directed 
mutagenesis", as originally outlined by Hurchinson and Edgell [J. Virol. S, 181 (1971)], 
involves the annealing of a synthetic oligonucleotide carrying the desired nucleotide 
substitution to a target region of a single-stranded DNA sequence wherein the mutation 
should be introduced [for review see Smith, Annu. Rev. Genet. 12, 423 (1985) and for 
improved methods see references 2-6 in Stanssen et al., Nucl. Acid Res., II. 4441-4454 
(1989)] Another possibility of mutating a given DNA sequence which is also preferred for 
the practice of the present invention is the mutagenesis by using the polymerase chain 
reaction (PCR). DNA as starting material can be isolated by methods known in the art and 
described e.g. in Sambrook et al. (Molecular Cloning) from the respective strains. For 
strain information see, e.g. EP 684 313 or any depository authority indicated below. 
Aspergillus niger [ATCC 9142], Myceliophthora thermophila [ATCC 48102], 
) Talaromyces thermophilus [ATCC 20186] and Aspergillus fumigatus [ATCC 34625] have 
been redeposited according to the conditions of the Budapest Treaty at the Amencan Type 
Culture Cell Collection under the following accession numbers: ATCC 74337, ATCC 
74340 ATCC 74338 and ATCC 74339, respectively. It is however, understood that DNA 
encoding a consensus protein in accordance with the present invention can also be 
5 prepared in a synthetic manner as described, e.g. in EP 747 483 or the examples by 
methods known in the art. 
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The process of the present invention can preferably be used in order to improve the 
thermostability of the enzyme phytase. After having constructed different consensus 
phytase sequences it was possible to decide whether single amino acid replacements had a 
positive or a negative effect on the protein stability. It is therefore another subject of the 
5 present invention to improve the thermostability of a phytase. 

In this embodiment single amino acids are changed in the sequence of the phytase by 
the introduction of at least one mutation selected from the group consisting of 



E58A 


F54Y 


D69K 


I73V 


D197N 


K94A 


T214L 


R101A 


E222T 


N153K 


E267D 


VI 581 


R291I 


A203G 


R329H 


S205G 


S364T 


V217A 


A379K 


A227V 


G404A 


V234L 




P238A 




Q277E 




A287H 




A292Q 




V366I 




A396S 




E415Q 




G437A 




E451R 
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In the above-given mutations the number represents the position in the consensus 
phytase-l-sequence as given in Figure 2 and the letter before the number represents the 
amino acid in the phytase which is replaced by the respective amino acid behind the 
number. The numbers given correspond to the consensus phytase sequence or relate to a 
5 corresponding residue calculated by an alignment as shown in Figure 1 when 26 amino 
acids (signal sequence) are added to the sequences shown in Fig. 1 . Those mutations can 
be introduced into consensus sequences or into sequences of specific enzymes which have 
been improved by a process of the present invention. The above-mentioned amino acid 
replacements have a positive effect on the protein stability. 

10 Once complete DNA sequences of the present invention have been obtained they can 

be integrated into vectors by methods known in the art and described e.g. in Sambrook et 
al. (s.a.) to overexpress the encoded polypeptide in appropriate host systems. However, a 
man skilled in the art knows that also the DNA sequences themselves can be used to 
transform the suitable host systems of the invention to get overexpression of the encoded 
15 polypeptide. Appropriate host systems are for example fungi, like Aspergilli, e.g. 
Aspergillus niger [ATCC 9142] or Aspergillus ficuum [NRRL 3135] or like Trichoderma, 
e.g. Trichoderma reesei or yeasts, like Saccharomyces, e.g. Saccharomyces cerevisiae or 
Pichia like Pichia pastoris, or Hansenula polymorpha, e.g. H. polymorpha (DSM5215) or 
plants' as described, e.g. by Pen et al., Bio/Technology 11,81 1-814 (1994). A man skilled 
20 in the art knows that such microorganisms are available from depository authorities, e.g. 
the American Type Culture Collection (ATCC), the Centraalbureau voor 
Schimmelcultures (CBS) or the Deutsche Sammlung fur Mikroorganismen und 
Zellkulturen GmbH (DSM) or any other depository authority as listed in the Journal 
"Industrial Property" [(1991) 1, pages 29-40]. Bacteria which can be used are e.g. E. coli, 
25 Bacilli as, e.g. Bacillus subtilis or Streptomyces, e.g. Streptomyces lividans (see e.g. Anne 
and Mallaert in FEMS Microbiol. Letters 1H, 121 (1993). E. coli, which could be used are 
E. coli K12 strains e.g. Ml 5 [described as DZ 291 by Villarejo et al. in J. Bacteriol. 12Q, 
466-474 (1974)], HB 101 [ATCC No. 33694] or E. coli SG13009 [Gottesman et al., J. 
Bacteriol. 14S, 265-273 (1981)]. 

30 Vectors which can be used for expression in fungi are known in the art and described 

e.g. in EP 420 358, or by Cullen et al. [Bio/Technology 5, 369-376 (1987)] or Ward in 
Molecular Industrial Mycology, Systems and Applications for Filamentous Fungi, Marcel 
Dekker, New York (1991), Upshall et al. [Bio/Technology 5_, 1301-1304 (1987)] Gwynne 
et al. [Bio/Technology 5, 71-79 (1987)], Punt et al. [J. Biotechnol. II, 19-34 (1991)] and 

35 for yeast by Sreekrishna et al. [J. Basic Microbiol. 23, 265-278 (1988), Biochemistry 2S, 
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4117-4125 (1989)], Hitzemann et al. [Nature 222, 717-722 (1981)] or in 
EP 183 070 EP 183 071, EP 248 227, EP 263 311. Suitable vectors which can be used for 
expression In E. coli are mentioned, e.g. by Sambrook et al. [s.a.] or by Fiers et al. in 
Procd 8th Int. Biotechnology Symposium" [Soc. Franc, de Microbiol., Paris (Durand et al., 
eds) pp 680-697 (1988)] or by Bujard et al. in Methods in Enzymology, eds. Wu and 
Grossmann, Academic Press, Inc. Vol. 155. 416-433 (1987) and Stuber et al. in 
Immunological Methods, eds. Lefkovits and Pernis, Academic Press, Inc., Vol. IV, 121- 
152 (1990) Vectors which could be used for expression in Bacilli are known in the art and 
described e g. in EP 405 370, Procd. Natl. Acad. Sci. USA SI, 439 (1984) by Yansura and 
Henner Meth. Enzymol. 1S5, 199-228 (1990) or EP 207 459. Vectors which can be used 
for the expression in H. Polymorpha are known in the art and described, e.g. m Gelhssen et 
al., Biotechnology 2, 291-295 (1991). 

Either such vectors already carry regulatory elements, e.g. promoters, or the DNA 
sequences of the present invention can be engineered to contain such elements. Suitable 
promotor elements which can be used are known in the art and are, e.g. for Trichoderma 
reesei the cbhl- [Haarki et al., Biotechnology 2, 596-600 (1989)] or the pkil-promotor 
[Schindler et al., Gene m 271-275 (1993)], for Aspergillus oryzae the amy-promotor 
[Christensen et al., Abstr. 19th Lunteren Lectures on Molecular Genetics F23 (1987), 
Christensen et al., Biotechnology S, 1419-1422 (1988), Tada et al., Mol. Gen. Genet. 222, 
301 (1991)], for Aspergillus niger the glaA- [Cullen et al., Bio/Technology 5_, 369-376 
(1987) Gwynne et al., Bio/Technology 5, 713-719 (1987), Ward in Molecular Industrial 
Mycology Systems and Applications for Filamentous Fungi, Marcel Dekker, New York, 
83-106 (1991)], alcA- [Gwynne et al., Bio/Technology 5, 718-719 (1987)], sucl- [Boddy et 
al Curr Genet. 24, 60-66 (1993)], aphA- [MacRae et al., Gene 21, 339-348 (1988), 
MacRae et al., Gene 122, 193-198 (1993)], tpiA- [McKnight et al., Cell 46, 143-147 
(1986), Upshall et al., Bio/Technology 5, 1301-1304 (1987)], gpdA- [Punt et al., Gene 6J>, 
49-57 (1988), Punt et al., J. Biotechnol. 12, 19-37 (1991)] and the pkiA-promotor [de 
Graaff et al., Curr. Genet. 22, 21-27 (1992)]. Suitable promotor elements which could be 
used for expression in yeast are known in the art and are, e.g. the pho5-promotor [Vogel et 
30 al., Mol. Cell. Biol., 2050-2057 (1989); Rudolf and Hinnen, Proc. Natl. Acad. Sci. 84, 
1340 1344 (1987)] or the gap-promotor for expression in Saccharomyces cerevisiae and for 
Pichia pastoris, e.g. the aoxl-promotor [Koutz et al., Yeast 5_, 167-177 (1989); Sreekrishna 
et al., J. Basic Microbiol. 2S, 265-278 (1988)], or the FMD promoter [Hollenberg et al., 
EPA No. 0299108] or MOX-promotor [Ledeboer et al., Nucleic Acids Res. 12, 3063-3082 
35 (1985)] for H. polymorpha. 



25 
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Accordingly vectors comprising DNA sequences of the present invention, preferably 
for the expression of said DNA sequences in bacteria or a fungal or a yeast host and such 
transformed bacteria or fungal or yeast hosts are also an object of the present invention. 

It is also an object of the present invention to provide a system which allows for high 
expression of proteins, preferably phytases like the consensus phytase of the present 
invention in Hansenula characterized therein that the codons of the encoding DNA 
sequence of such a protein have been selected on the basis of a codon frequency table of 
the organism used for expression, e.g. yeast as in the present case (see e.g. m Example 3) 
and optionally the codons for the signal sequence have been selected in a manner as 
described for the specific case in Example 3. That means that a codon frequency table is 
prepared on the basis of the codons used in the DNA sequences which encode the ammo 
acid sequences of the defined protein family. Then the codons for the design of the DNA 
sequence of the signal sequence are selected from a codon frequency table of the host cell 
used for expression whereby always codons of comparable frequency in both tables are 
used. 

Once such DNA sequences have been expressed in an appropriate host cell in a 
suitable medium the encoded protein can be isolated either from the medium in the case 
the protein is secreted into the medium or from the host organism in case such protein is 
present intracellular^ by methods known in the art of protein purification or described in 
case of a phytase, e.g. in EP 420 358. Accordingly a process for the preparation of a 
polypeptide of the present invention characterized in that transformed bacteria or a host 
cell as described above is cultured under suitable culture conditions and the polypeptide is 
recovered therefrom and a polypeptide when produced by such a process or a polypeptide 
encoded by a DNA sequence of the present invention are also an object of the present 
invention. 

Once obtained the polypeptides of the present invention can be characterized 
regarding their properties which make them useful in agriculture any assay known in the 
art and described e.g. by Simons et al. [Br. J. Nutr. M, 525-540 (1990)], SchSner et al. [J. 
Anim Physiol, a. Anim. Nutr. fifi, 248-255 (1991)], Vogt [Arch. GeflOgelk. Sfi, 93-98 
(1992)], Jongbloed et al. [J. Anim. Sci., 7J2, 1 159-1168 (1992)], Perney et al. [Poultry Sc.. 
22 2106-21 14 (1993)], Farrell et al., [J. Anim. Physiol, a. Anim. Nutr. £2, 278-283 (1993), 
Broz et al., [Br. Poultry Sci. 25, 273-280 (1994)] and Dungelhoef et al. [Animal Feed Sci. 
Technol. 42, 1-10 (1994)] can be used. 

In general the polypeptides of the present invention can be used without being 
limited to a specific field of application, e.g. in case of phytases for the conversion of 
inositol polyphosphates, likephytate to inositol and inorganic phosphate. 
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Furthermore the polypeptides of the present invention can be used in a process for 
the preparation of a pharmaceutical composition or compound food or feeds wherein the 
components of such a composition are mixed with one or more polypeptides of the present 
invention. Accordingly compound food or feeds or pharmaceutical composite 
comprising one or more polypeptides of the present invention are also an object of the 
present invention. A man skilled in the art is familiar with their process of preparation. 
Such pharmaceutical compositions or compound foods or feeds can further compnse 
additives or components generally used for such purpose and known in the state of the art. 

It is furthermore an object of the present invention to provide a process for the 
reduction of levels of phytate in animal manure characterized in that an animal is fed such 
a feed composition in an amount effective in converting phytate contained in the feedstuff 
to inositol and inorganic phosphate. 

Before describing the present invention in more detail a short explanation of the 
Figures enclosed is given below. 

Figure 1: Design of the consensus phytase sequence. The letters represent the amino acid 
residues in the one-letter code. The following sequences were used for the alignment: phyA 
from Aspergillus terreus 9A-1 (Mitchell et al, 1997; from amino acid (aa) 21), phyA from 
A terreus cbsl 16.46; (van Loon et al., 1998; from aa 27), phyA from Aspergillus niger var. 
aW amori (Piddington et al, 1993; from aa 27), phyA from A. niger T213; from aa 27), phyA 
from A niger strain NRRL3135 (van Hartingsveldt et al, 1993; from aa 27),^ from 
Aspergillus fumigatus ATCC 13073 (Pasamontes et al, 1993; from aa 25), phyA from A. 
fumigatus ATCC 32722 (van Loon et al, 1998; from aa 27), phyA from A. fumigatus 
ATCC 58128 (van Loon et al, 1998; from aa 27), phyA from A. fumigatus ATCC 26906 
(van Loon et al, 1998; from aa 21),phyA from A. fumigatus ATCC 32239 (van Loon et al, 
1998- from aa 30), phyA from Emericella nidulans (Pasamontes et al, 1997a; from aa 25), 
phyA from Talaromyces thermophilus (Pasamontes et al, 1997a; from aa 24), and/^A 
from Myceliophthora thermophila (Mitchell et al, 1997; from aa 19). The alignment was 
calculated using the program PILEUP. The location of the gaps was refined by hand. 
Capitalized amino acid residues in the alignment at a given position belong to the amino 
acid coalition that establish the consensus residue. In bold, beneath the calculated 
consensus sequence, the amino acid sequence of the finally constructed consensus phytase 
(Fcp) is shown. The gaps in the calculated consensus sequence were filled by hand 
according to principals stated in Example 1 . 
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Eigma^ DNA sequence of the consensus phytase-1 gene (fcp) and of the primers used for 
the gene construction. The calculated amino acid sequence (Figure 1) was converted into a 
DNA sequence using the program BACKTRANSLATE (Devereux et al., 1984) and the 
codon frequency table of highly expressed yeast genes (GCG program package, 9.0). The 
signal peptide of the phytase from A. terreus cbs. 1 1 6.46 was fused to the AMerminus. The 
bold bases represent the sequences of the oligonucleotides used to generate the gene. The 
names of the respective oligonucleotides are alternately noted above or below the 
sequence. The underlined bases represent the start and stop codon of the gene. The bases 
written in italics show the two introduced Eco RI sites. 



10 Figure^: Alignment and consensus sequence of five Basidiomycetes phytases. The letters 
represent the amino acid residues in the one-letter code. The amino acid sequences of the 
phytases from Paxillus involutus, phyAl (aa 21) and phyA2 (aa 21, WO 98/28409), 
Trametespubescens (aa 24, WO 98/28409), Agrocybe pediades (aa 19, WO 98/28409), 
and Peniophora lycii (aa 21 , WO 98/28409) starting with the amino acid residues 

15 mentioned in parentheses, were used for the alignment and the calculation of the 

corresponding consensus sequence called "Basidio" (Example 2). The alignment was 
performed by the program PILEPUP. The location of the gaps was refined by hand. The 
consensus sequence was calculated by the program PRETTY. While a vote weight of 0.5 
was assigned to the two P. involutus phytases, all other genes were used with a vote weight 

20 of 1 0 for the consensus sequence calculation. At positions, where the program was not 
able to determine a consensus residues, the Basidio sequence contains a dash. Capitalized 
amino acid residues in the alignment at a given position belong to the amino acid coalition 
that establish the consensus residue. 

Figure 4: Design of consensus phytase-10 amino acid sequence. Adding the phytase 
25 sequence of Thermomyces lanuginosa (Berka et al., 1998) and the consensus sequence of 
the phytases from five Basidiomycetes to the alignment of Figure 1, an improved 
consensus sequence was calculated by the program PRETTY. Additionally, the amino acid 
sequence of A. niger T213 was omitted, therefore, using a vote weight of 0.5 for the 
remaining A. niger phytase sequences. For further information see Example 2. 



30 



Fjguie^iDNA and amino acid sequence of consensus phytase-10. The amino acid 
sequence is written above the corresponding DNA sequence using the one-letter code. The 
sequence of the oligonucleotides which were used to assemble the gene are in bold letters. 
The label of oligonucleotides and the amino acids, which were changed compared to those 
for consensus phytase -1, are underlined and their corresponding triplets are highlighted m 
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small cases. The/cplO gene was assembled from the following oligonucleotides: CP-1, 
CP-2 CP-3 10, CP-4.10, CP-5.10, CP-6, CP-7.10, CP-8.10, CP-9.10, CP-10.10, CP-1 1.10, 
CP-12.10, CP-13.10, CP-14.10, CP-15.10, CP-16.10, CP-17.10, CP18.10, CP-19.10, CP- 
20.10, CP-21.10, CP-22.10. The newly synthesized oligonucleotides are additionally 

5 marked by number 1 0. The phytase contains the following 32 exchanges: Y54F, E58A, 
D69K, D70G, A94K.N134Q, I158V, S187A, Q188N, D197N, S204A, T214L.D220E, 
L234V A238P, D246H, T251N, Y259N, E267D, E277Q, A283D, R291I, A320V, 
R329H, S364T, I366V, A379K, S396A, G404A, Q415E, A437G, A463E. The mutations 
accentuated in bold letters revealed a stabilizing effect on consensus phytase-1 as tested as 

10 single mutation in consensus phytase- 1 . 

Figure 6: Alignment for the design of consensus phytase-1 1. In contrast to the design of 
consensus phytase-10, for the design of the amino acid sequence of consensus phytase-1 1, 
all Basidiomycetes phytases were used as independent sequences using an assigned vote 
weight of 0.2 for each Basidiomycetes sequence. Additionally, the amino acid sequence of 
15 A. niger T213 was used in that alignment, again. 

Figure 7 ; DNA and amino acid sequence of consensus phytase-1 -thermo[8]-Q50T-K91 A. 
The amino acid sequence is written above the corresponding DNA sequence using the one- 
letter code. The replaced amino acid residues are underlined. The stop codon of the gene is 
marked by a star (*). 

20 Figure 8: DNA and amino acid sequence of consensus phytase- 10-thermo[3]-Q50T-K91 A. 
The amino acid sequence is written above the corresponding DNA sequence using the one- 
letter code. The replaced amino acid residues are underlined. The stop codon of the gene is 
marked by a star (*). 

Figured DNA and amino acid sequence of A. fumigatus ATCC 13073 phytase a-mutant. 
25 The amino acid sequence is written above the corresponding DNA sequence using the one- 
letter code. The replaced amino acid residues are underlined. The stop codon of the gene is 
marked by a star (*). 

EigmaJGiDNA and amino acid sequence of consensus phytase-7. The amino acids are 
written above the corresponding DNA sequence using the one-letter code. The sequence of 
30 the oligonucleotides used to assemble the gene are in bold letters. Oligonucleotides and 
amino acids that were exchanged are underlined and their corresponding triplets are 
highlighted in small cases. The/cp7 gene was assembled from the following 
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oligonucleotides: CP-U CP-2, CP-3, CP-4.7, CP-5.7, CP-6, CP-7, CP-8.7, CP-9, CP-10.7, 
CP-1 1.7, CP-12.7, CP-13.7, CP-14.7, CP-15.7, CP-16, CP-17.7, CP-18.7, CP-19.7, CP-20, 
CP-21, CP-22. The newly synthesized oligonucleotides are additionally marked by number 
7 The'phytase contains the following 24 exchanges in comparison to the original 
consensus phytase: S89D, S92G, A94K, D164S, P201S, G203A, G205S, H212P, G224A, 
D226T, E255T, D256E, V258T, P265S, Q292H, G300K, Y305H, A314T, S364G, M365I, 
A397S, S398A, G404A, and A405S. 

EigmeOi- Differential scanning calorimetry (DSC) of consensus phytase-1 and consensus 
phytase-10. The protein samples were concentrated to ca. 50-60 mg/ml and extensively 
dialyzed against 10 mM sodium acetate, pH 5.0. A constant heating rate of 10 °C/m.n was 
applied up to 95 °C. DSC of consensus phytase-10 (upper graph) yielded a melting 
temperature of 85.4 °C, which is 7.3 °C higher than the melting point of consensus 
phytase-1 (78.1 °C, lower graph). 

Wp nre12 : Differential scanning calorimetry (DSC) of consensus phytase-1 0-thermo-QSOT 
and consensus phytase- 1 0-thermo-Q50T-K9 1 A. The protein samples were concentrated to 
ca 50-60 mg/ml and extensively dialyzed against 10 mM sodium acetate, pH 5.0. A 
constant heating rate of 10 °C/min was applied up to 95 °C. DSC of consensus phytase-10- 
thermo-Q50T (upper graph) yielded a melting temperature of 88.6 °C, while the melting 
point of consensus phytase- 1 0-thermo-Q5 0T-K9 1 A was found at 89.3 "C. 

p-ignre 13: Comparison of the temperature optimum between consensus phytase-1, 
consensus phytase-10 and consensus phytase- 10-thermo-Q50T. For the determination of 
the temperature optimum, the phytase standard assay was performed at a series of 
temperatures between 37 and 86 °C. The diluted supernatant of transformed S. cerevisiae 
strains was used for the determination. The other components of the supernatant showed 
no influence on the determination of the temperature optimum: a, consensus phytase-1; 
^, consensus phytase-10; ■, consensus phytase 10-thermo-Q50T. 

Fipnn.14: pH-dependent activity profile and substrate specificity of consensus phytase-10 
and its variants thermo-Q50T and thermo-Q50T-K91 A. The phytase activity was 
determined using the standard assay in appropriate buffers (see Example 9) at Afferent pH- 
values. Graph a) shows the pH-dependent activity profile of consensus phytase-10 (□), 
consensus phytase-10-thermo-Q50T ( . ), and consensus phytase- 10-thermo-Q50T-K91 A 
(a) Graph b) shows the corresponding substrate specificity tested by replacement of 
phytate by the indicated compounds in the standard assay; open bars, consensus phytase-10 
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(grey bars, consensus phytase-10-thermo-Q50T; dark bars, consensus phytase- 10-thermo- 
Q50T-K91A). The numbers correspond to the following compounds: 1, phytate; 2,p- 
nitrophenyl phosphate; 3, phenyl phosphate; 4, fructose- 1,6-bisphosphate; 5, fructose-6- 
phosphate; 6, glucose-6-phosphate; 7, ribose-5-phosphate; 8, DL-glycerol-3 -phosphate; 9, 
glycerol-2-phosphate; 10, 3-phosphoglycerate; 1 1 , phosphoenolpyruvate; 12, AMP; 13, 
ADP; 14, ATP. 

Figured : pH-dependent activity profile and substrate specificity of consensus phytase-1 - 
thermo[8]-Q50T and of consensus phytase- 1 -thermo[8]-Q50T-K9 1 A. The phytase activity 
was determined using the standard assay in appropriate buffers (see Example 9) at different 
pH-values. Graph a) shows the pH-dependent activity profile of the Q50T- (■) and the 
Q50T-K91 A-variant ( . ). Graph b) shows the corresponding substrate specificities tested 
by replacement of phytate by the indicated compounds in the standard assay (open bars, 
consensus phytase- l-thermo[8]-Q50T; filled bars, consensus phytase-l-thermo[8]-Q50T- 
K91 A.). The substrates are listed in the legend of Figure 14. 

Figure 16: Differential scanning calorimetry (DSC) of consensus phytase- l-thermo[8]- 
Q50T and consensus phytase-l-thermo[8]-Q50T-K91A. The protein samples were 
concentrated to ca. 50-60 mg/ml and extensively dialyzed against 10 mM sodium acetate, 
pH 5.0. A constant heating rate of 10 °C/min was applied up to 95 °C. DSC of consensus 
phytase-l-thermo[8]-Q50T (upper graph) showed a melting temperature of 84.7 °C, while 
the melting point of consensus phytase- l-thermo[8]-Q50T-K91 A was found at 85.7 °C. 

Figure 17: Comparison of the temperature optimum between consensus phytase- 1 , 
consensus phytase- l-thermo[3] and consensus phytase- l-thermo[8]. For the determination 
of the temperature optimum, the phytase standard assay was performed at a series of 
temperatures between 37 and 86 °C. Purified protein from the supernatant of transformed 
S. cerevisiae strains was used for the determination. O, consensus phytase- 1 ; □ , 
consensus phytase-l-thermo[3]; ^ , consensus phytase l-thermo[8]. 

Figure 18: Comparison of the pH-dependent activity profile and substrate specificity of 
consensus phytase-1, consensus phytase-7, and of thephytase from A. mgerNRRL 3135. 
The phytase activity was determined using the standard assay in appropriate buffers (see 
Example 9) at different pH-values. Graph a) shows the pH-dependent activity profile of 
consensus phytase-1 (■), the phytase from A. mger NRRL 3135 (O), and of consensus 
phytase-7 (^ ). Graph b) shows the corresponding substrate specificity tested by 
replacement of phytate by the indicated compounds in the standard assay (black bars,/!. 
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niger NRRL 3135 phytase; grey bars, consensus phytase-1, dashed bars, consensus 
phytase-7). The substrates are listed in the legend of Figure 14. 

Figure 19: Differential scanning calorimetry (DSC) of the phytase from A. fumigatus 
ATCC 13073 and of its stabilized a-mutant, which contains the following amino acid 
5 exchanges F55Y, V100I, F114Y, A243L, S265P, N294D. 



The protein samples were concentrated to ca. 50-60 mg/ml and extensively dialyzed 
against 10 mM sodium acetate, pH 5.0. A constant heating rate of 10 °C/min was applied 
up to 95 °C. DSC of consensus A. fumigatus 13073 phytase (upper graph) revealed a 
melting temperature of 62.5 °C, while the melting point of the a-mutant was found at 67.0 
10 °C 



Figure 20: Comparison of the temperature optimum of A. fumigatus 13073 wild-type, its A. 
fumigatus a-mutant, and a further stabilized a-mutant (E59A-S126N-R329H-S364T- 
G404A). For the determination of the temperature optimum, the phytase standard assay 
was performed at a series of temperatures between 37 and 75 °C. The diluted supernatant 

1 5 of transformed S. cerevisiae strains was used for the determination. The other components 
of the supernatant showed no influence on the determination of the temperature optimum. 
O, A. fumigatus ATCC 13073 phytase; * , A. fumigatus ATCC 13073 a-mutant; □, A. 
fumigatus ATCC 13073 alpha-mutant-(E59A-S126N-R329H-S364T-G404A)-Q27T; ■ , 
A. fumigatus ATCC 13073 a-mutant-(E59A-S126N-R329H-S364T-G404A)-Q27T-K68A. 

20 Q27T and K68A corresponds to consensus phytase-1 Q50T and K91 A, respectively. 



Figure 21 : Amino acid sequence of consensus phytase 12 (consphyl2) which contains a 
number of active site residues transferred from the "basidio" consensus sequence to 
consensus phytase-1 0-thermo-Q50T-K9 1 A. 
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Kxample 1 : 

Qesign of the amino acid sequence of consensus phvtase-1 

Alignment of the amino acid sequences 

The alignment was calculated using the program PILEUP from the Sequence 
5 Analysis Package Release 9.0 (Devereux et al, 1984) with the standard parameter (gap 
creation penalty 12, gap extension penalty 4). The location of the gaps was refined using a 
text editor. Table 1 shows the sequences (see Figure 1) without the signal sequence that 
were used for the performance of the alignment starting with the amino acid (aa) as 
mentioned in Table 1. 

10 Table 1 : Origin and vote weight of the phvtase amino acid sequences used for the design of 
consensus phvtase-1 

-phyA from Aspergillus terreus 9A-1 , aa 27, vote weight 0.5 (Mitchell et al, 1997) 
-phyA from Aspergillus terreus cbsl 16.46, aa 27, vote weight 0.5 (van Loon et al, 1998) 

- phyA from Aspergillus niger var. awamori, aa 27, vote weight 0.33 (Piddington et al, 
15 1993) 

-phyA from Aspergillus niger T213, aa 27, vote weight 0.33 

-phyA from Aspergillus niger strain NRRL3 135, aa 27, vote weight 0.33 (van 

Hartingsveldt et al, 1993) 
-phyA from Aspergillus fumigatus ATCC 13073, aa 26, vote weight 0.2 (Pasamontes et 
20 a/., 1997) 

- phyA from Aspergillus fumigatus ATCC 32722, aa 26, vote weight 0.2 (van Loon et al, 
1998) 

-phyA from Aspergillus fumigatus ATCC 58128, aa 26, vote weight 0.2 (van Loon et al, 
1998) 

25 - phyA from Aspergillus fumigatus ATCC 26906, aa 26, vote weight 0.2 (van Loon et al, 
1998) 

-phyA from Aspergillus fumigatus ATCC 32239, aa 30, vote weight 0.2 (van Loon et al, 
1998) 

-phyA from Emericella nidulans , aa 25, vote weight 1.0 (Roche Nr. R1288, Pasamontes 
30 et al, 1997a) 

- phyA from Talaromyces thermophilus ATCC 20186, aa 24, vote weight 1.0 (Pasamontes 
et al, 1997a) 

-phyA from Myceliophthora thermophila, aa 19, vote weight 1.0 (Mitchell et al, 1997) 
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Calculation of the amino acid sequence of consensus 
phytase-1 

Using the refined alignment as input, the consensus sequence was calculated by the 
program PRETTY from the Sequence Analysis Package Release 9.0 (Devereux et al., 

5 1984). PRETTY prints sequences with their columns aligned and can display a consensus 
sequence for an alignment. A vote weight that pays regard to the similarity between the 
amino acid sequences of the phytases aligned was assigned to all sequences. The vote 
weight was set such as the combined impact of all phytases from one sequence subgroup 
(same species, but from different strains), e. g. the amino acid sequences of all phytases 

10 from A. fumigatus, on the election was set one, that means that each sequence contributes 
with a value of 1 divided by the number of strain sequences (see Table 1). By this means, it 
was possible to prevent that very similar amino acid sequences, e. g. of the phytases from 
different A. fumigatus strains, dominate the calculated consensus sequence. 

The program PRETTY was started with the following parameters: The plurality 
15 defining the number of votes below which there is no consensus was set on 2.0. The 

threshold, which determines the scoring matrix value below which an amino acid residue 
may not vote for a coalition of residues, was set on 2. PRETTY used the PrettyPep.Cmp 
consensus scoring matrix for peptides. 

Ten positions of the alignment (position 46, 66, 82, 138, 162, 236, 276, 279, 280, 
20 308; Figure 1), for which the program was not able to determine a consensus residue, were 
filled by hand according to the following rules: if a most frequent residue existed, this 
residue was chosen (138, 236, 280); if a prevalent group of similar or phylogenetically 
equivalent residues occurred, the most frequent or, if not available, one residues of this 
group was selected (46, 66, 82, 162, 276, 308). If there was either a prevalent residue nor a 
25 prevalent group, one of the occurring residues was chosen according to common 

assumption on their influence on the protein stability (279). Eight other positions (132, 
170 204, 211, 275, 317, 384, 447; Figure 1) were not filled with the amino acid residue 
selected by the program but normally with amino acids that occur with the same frequency 
as the residues that were chosen by the program. In most cases, the slight underrating of 
30 the three A. niger sequences (sum of the vote weights: 0.99) was eliminated by this 
corrections. 



Conversion of the consensus phytase-1 amino acid sequence to a DNA 
sequence 

The first 26 amino acid residues of A. terreus cbsl 16.46 phytase were used as signal 
35 peptide and, therefore, fused to the N-terminus of all consensus phytases. For this stretch, 
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we used a special method to calculate the corresponding DNA sequence. Purvis et al 
(1987) proposed that the incorporation of rare codons in a gene has an influence on the 
folding efficiency of the protein. Therefore, at least the distribution of rare codons in the 
signal sequence of A. terreus cbsl 16.46, which was used for the consensus phytase and 
5 which is very important for secretion of the protein, but converted into the S. cerevisiae 
codon usage, was transferred into the new signal sequence generated for expression in S. 
cerevisiae. For the remaining parts of the protein, we used the codon frequency table of 
highly expressed S. cerevisiae genes, obtained from the GCG program package, to translate 
the calculated amino acid sequence into a DNA sequence. 

10 The resulting sequence of the fcp gene is shown in Figure 2. 

Construction and cloning of the consensus phytase-1 gene 

The calculated DNA sequence of consensus phytase-1 (fcp) was divided into 
oligonucleotides of 85 bp, alternately using the sequence of the sense and the anti-sense 
strand. Every oligonucleotide overlaps 20 bp with its previous and its following 
15 oligonucleotide of the opposite strand. The location of all primers, purchased by 

Microsynth, Balgach (Switzerland) and obtained in a PAGE-purified form, is indicated in 
Figure 2. 



PCR-Reactions 

In three PCR reactions, the synthesized oligonucleotides were composed to the entire 
20 gene. For the PCR, the High Fidelity Kit from Boehringer Mannheim (Boehringer 
Mannheim, Mannheim, Germany) and the thermo cycler The Protokol™ from AMS 
Biotechnology (Europe) Ltd. (Lugano, Switzerland) was used. 

Oligonucleotide CP-1 to CP- 10 (Mix 1, Figure 2) were mixed to a concentration of 
0.2 pMol/jil of each oligonucleotide. A second oligonucleotide mixture (Mix 2) was 
25 prepared with CP-9 to CP-22 (0.2 pMol/u.1 of each oligonucleotide). Additionally, four 
short primers were used in the PCR reactions: 

CP-a: Eco RI 

5 '-TATAT04/4 7TCATGGGCGTGTTCGTC-3 ' 

CP-b: 

30 5 ' -TGAAAAGTTC ATTG AAGGTTTC-3 ' 

CP-c: 

5 ' -TCTTCGAAAGCAGTAC AAGTAC-3 ' 
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CP-e: Eco RI 

5 * -TATATGAA 7TCTTAAGCG AAAC-3 ' 



PCR reaction a: 



10 



PCR reaction b: 



15 



10 pi Mix 1 (2.0 pmol of each oligonucleotide) 

2 id nucleotides (10 mM each nucleotide) 

2 jil primer CP-a (10 pmol/^il) 

2 jil primer CP-c (10 pmol/jxl) 

10,0 |al PCR buffer 

0.75 jil polymerase mixture 

73.25 nlH 2 0 

10 ill Mix 2 (2.0 pmol of each oligonucleotide) 

2 ill nucleotides (10 mM each nucleotide) 

2 \il primer CP-b (10 pmol/j±l) 

2 jil primer CP-e (10 pmol/jal) 

10,0 [il PCR buffer 

0.75 jil polymerase mixture (2.6 U) 

73.25 >il H 2 Q 



20 



Reaction conditions for PCR reaction a and b: 

step 1 2 min - 45°C 

step 2 30 sec - 72°C 

step 3 30 sec - 94°C 

step 4 30 sec - 52°C 

step 5 1 min - 72°C 

Step 3 to 5 were repeated 40-times. 



25 The PCR products (670 and 905 bp) were purified by an agarose gel electrophoresis 

(0.9% agarose) and a following gel extraction (QIAEX II Gel Extraction Kit, Qiagen, 
Hilden, Germany). The purified DNA fragments were used for the PCR reaction c. 



PCR reaction c: 6 |il PCR product of reaction a («50 ng) 

6 nl PCR product of reaction b («50 ng) 
30 2 (il primer CP-a (10 pmol/jj.1) 

2 nl primer CP-e (10 pmol/fil) 
10,0 |il PCR buffer 
0.75 nl polymerase mixture (2.6 U) 
73.25 nl H 2 0 



35 Reaction conditions for PCR reaction c: 
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step 1 2 min - 94°C 

step 2 30 sec - 94°C 

step 3 30 sec - 55°C 

step 4 1 min - 72°C 

5 Step 2 to 4 were repeated 31 -times. 

The resulting PCR product (1.4 kb) was purified as mentioned above, digested with 
Eco RI, and ligated in an Eco Rl-digested and dephosphorylated pBsk(-)-vector 
(Stratagene, La Jolla, CA, USA). 1 pi of the ligation mixture was used to transform E, coli 
XL-1 competent cells (Stratagene, La Jolla, CA, USA). All standard procedures were 
10 carried out as described by Sambrook et ai. (1987). The DNA sequence of the constructed 
consensus phytase gene (fcp, Figure 2) was controlled by sequencing as known in the art. 

Example 2 

Design of an improved consensus phytase (consensus phytase-10^ amino acid 
sequence 

15 The alignments used for the design of consensus phytase- 10 were calculated using 

the program PILEUP from the Sequence Analysis Package Release 9.0 (Devereux et aL 9 
1984) with the standard parameter (gap creation penalty 12, gap extension penalty 4). The 
location of the gaps was refined using a text editor. 

The following sequences were used for the alignment of the Basiodiomycetes phytases 
20 starting with the amino acid (aa) mentioned in Table 2: 

Table 2: Origin and vote weight of five Basidiomycetes phytases used for the calculation of 
the corresponding amino acid consensus sequence fbasidio^ 

-phyAl from Paxillus involutus NN005693, aa 21, vote weight 0.5 (WO 98/28409) 
-phyA2 from Paxillus involutus NN005693, aa 21, vote weight 0.5 (WO 98/28409) 
25 - phyA from Trametes pubescens NN9343, aa 24, vote weight 1 .0 (WO 98/28409) 
-phyA from Agrocybe pediades NN009289, aa 19, vote weight 1.0 (WO 98/28409) 
-phyA from Peniophora lycii NN006113, aa 21, vote weight 1.0 (WO 98/28409) 

The alignment is shown in Figure 3. 

In Table 3 the genes, which were used for the performance of (he final alignment, are 
30 arranged. The first amino acid (aa) of the sequence which is used in the alignment is 
mentioned behind the organism designation. 
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Tahle 3: Origin and vote weight of the phytase se quences used for the design of consensus 
phytase 10 

-phyA from Aspergillus terreus 9A-1, aa 27, vote weight 0.5 (Mitchell et a/., 1997) 
-phyA from Aspergillus terreus cbsl 16.46, aa 27, vote weight 0.5 (van Loon et al, 1998) 
5 - phyA from Aspergillus niger var. awamori, aa 27, vote weight 0.5 (Piddington et al, 
1993) 

-phyA from Aspergillus niger strain NRRL3135, aa 27, vote weight 0.5 (van Hartingsveldt 
etal, 1993) 

-phyA from Aspergillus fumigatus ATCC 13073, aa 26, vote weight 0.2 (Pasamontes et 
10 ah y 1997) 

-phyA from Aspergillus fumigatus ATCC 32722, aa 26, vote weight 0.2 (van Loon et al, 
1998) 

- phyA from Aspergillus fumigatus ATCC 58128, aa 26, vote weight 0.2 (van Loon et al, 
1998) 

15 -phyA from Aspergillus fumigatus ATCC 26906, aa 26, vote weight 0.2 (van Loon et ah, 
1998) 

-phyA from Aspergillus fumigatus ATCC 32239, aa 30, vote weight 0.2 (van Loon et ah, 
1998) 

-phyA from Emericella nidulans , aa 25, vote weight 1.0 (Roche Nr. R1288, Pasamontes 
20 etal, 1997a) 

-phyA from Talaromyces thermophilus ATCC 20186, aa 24, vote weight 1.0 (Pasamontes 
etal, 1997a) 

-phyA from Myceliophthora thermophila, aa 19, vote weight 1 .0 (Mitchell et a!. 9 1997) 
-phyA from Thermomyces lanuginosa, aa 36, vote weight 1.0 (Berka et al> 1998) 
25 - Consensus sequence of five Basidiomycetes phytases, vote weight 1 .0 (Basidio, Figure 3) 

The corresponding alignment is shown in Figure 4. 

Calculation of the amino acid sequence of consensus-10 

To improve the alignment, we added the original consensus sequence of five 
phytases from four different Basidiomycetes, called Basidio, still containing the undefined 

30 sequence positions (see Figure 3), nearly all phytase sequences used for calculation of the 
original consensus phytase and one new phytase sequence from the Ascomycete 
Thermomyces lanuginosa to a larger alignment. Using the consensus sequence of the 
basidiomycetal phytase sequences, does not pay regard to the diversity among the five 
amino acid sequences, but pays regard to the common and different amino acid residues 

35 between the phytases from the Ascomycetes and the Basidiomycetes. 
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We set plurality on 2.0 and threshold on 3. The used vote weight are listed in Table 3. The 
alignment and the corresponding consensus sequence is presented in Figure 4. The new 
consensus phytase sequence has 32 different amino acids in comparison to the original 
consensus phytase. Positions for which the program PRETTY was not able to calculate a 
consensus amino acid residue were filled according to rules mentioned in Example 1 . None 
of the residues suggested by the program was replaced. 

Furthermore, we included all Basidiomycetes phytases as single amino acid sequences but 
assigning a vote weight of 0.2 in the alignment. The corresponding alignment is shown in 
Figure 6. The calculated consensus amino acid sequence (consensus phytase- 1 1) has the 
following differences to the sequence of consensus phytase-10. Letter X means that the 
program was not able to calculate a consensus amino acid; the amino acid in parenthesis 
corresponds to the amino acid finally included into the consensus phytase-10. 

D35X, X(K)69K, X(E)100E, A101R, Q134N, X(K)153N, X(H)190H, X(A)204S, 
X(E)220D, E222T, V227A, X(R)271R, H287A, X(D)288D, X(K)379K, X(I)389I, E390X, 
X(E)415E, X(A)416A, X(R)446L, E463A, whereas the numbering is as in Fig. 5. 

We also checked single amino acid replacements suggested by the improved consensus 
sequences 10 and 1 1 on their influence on the stability of the original consensus phytase. 
The approach is described in example 3. 

Conversion of consensus phytase-10 amino acid sequence to a DNA 
sequence 

The first 26 amino acid residues of A. terreus cbsl 16.46 phytase were used as signal 
peptide and, therefore, fused to the A^-terminus of consensus phytase-10. The used 
procedure is further described in Example 1 . 

The resulting sequence of the fcplO gene is shown in Figure 5. 



Construction and cloning of the consensus phytase-10 gene (fcplO) 

The calculated DNA sequence of/cplO was divided into oligonucleotides of 85 bp, 
alternately using the sequence of the sense and the anti-sense strand. Every oligonucleotide 
overlaps 20 bp with its previous and its following oligonucleotide of the opposite strand. 
The location of all primers, purchased by Microsynth, Balgach (Switzerland) and obtained 
in a PAGE-purified form, is indicated in Figure 5. 
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PCR-Reactions 

In three PCR reactions, the synthesized oligonucleotides were composed to the entire 
gene. For the PCR, the High Fidelity Kit from Boehringer Mannheim (Boehringer 
Mannheim, Mannheim, Germany) and the thermo cycler The Protokol from AMS 
5 Biotechnology (Europe) Ltd. (Lugano, Switzerland) was used. The following 
oligonucleotides were used in a concentration of 0.2 pMol/ml. 



Mix 110: CP-1, CP-2, CP-3.10, CP-4.10, CP-5.10, CP-6, CP-7.10, CP-8.10, CP-9.10, CP- 



Mix 2.10: CP-9.10, CP-1 1.10, CP-12.10, CP-13.10, CP-14.10, CP-15.10, CP-16.10, CP- 
17.10, CP18.10, CP-19.10, CP-20.10, CP-21.10, CP-22.10 

The newly synthesized oligonucleotides are marked by number 10. The phytase contains 
the following 32 exchanges, which are underlined in Figure 5, in comparison to the 
original consensus phytase: Y54F, E58A, D69K, D70G, A94K, N134Q, I158V, S187A, 
Q188N, D197N, S204A, T214L, D220E, L234V, A238P, D246H, T251N, Y259N, 
E267D, E277Q, A283D, R291I, A320V, R329H, S364T, I366V, A379K, S396A, G404A, 
Q415E, A437G, A463E. 

Four short PCR primer were used for the assembling of the oligonucleotides: 

CP-a: Eco RI 

5'-TATATGX47TCATGGGCGTGTTCGTC-3' 



10.10 



20 CP-b: 



5 '-TG AAAAGTTC ATTGAAGGTTTC-3 ' 



CP-c.10: 



25 



CP-e: 



5 ' -TCTTCG AAAGC AGT AC AC AAAC-3 ' 
Eco HI 

5 ' -TAT AT GAA 7TCTTAAGCGAAAC-3 ' 



30 



PCR reaction a: 



10 ul Mix 1.10 (2.0 pmol of each oligonucleotide) 

2 ul nucleotides (10 mM each nucleotide) 

2 ul primer CP-a (10 pmol/ml) 

2 ul primer CP-c.10 (10 pmol/ml) 

10,0 ul PCR buffer 

0.75 ul polymerase mixture 

73.25 ul H2O 
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PCR reaction b\ 1 0 \x\ Mix 2. 1 0 (2.0 pmol of each oligonucleotide) 

2 ill nucleotides (10 mM each nucleotide) 
2 (^1 primer CP-b (10 pmol/ml) 
2 til primer CP-e (10 pmol/ml) 
5 10,0 nlPCR buffer 

0.75 fj.1 polymerase mixture (2.6 U) 
73.25 nl H2O 

Reaction conditions for PCR reaction a and b: 



step 1 


2 min - 


45 °C 


step 2 


30 sec 


-72 °C 


step 3 


30 sec 


-94 °C 


step 4 


30 sec 


-52 °C 


step 5 


1 min - 


72 °C 



Step 3 to 5 were repeated 40- times. 

15 The PCR products (670 and 905 bp) were purified by an agarose gel electrophoresis 

(0.9% agarose) and a following gel extraction (QIAEX II Gel Extraction Kit, Qiagen, 
Hilden, Germany). The purified DNA fragments were used for the PCR reaction c. 

PCR reaction c: 6 p\ PCR product of reaction a »50 ng) 

6 PCR product of reaction b «50 ng) 
20 2 jal primer CP-a (10 pmol/ml) 

2 yil primer CP-e (10 pmol/ml) 
10,0 ^1 PCR buffer 
0.75 fj.1 polymerase mixture (2.6 U) 
73.25 yil H2O 

25 Reaction conditions for PCR reaction c: 

step 1 2 min - 94 °C 

step 2 30 sec - 94 °C 

step 3 30 sec - 55 °C 

step 4 1 min - 72 °C 

30 Step 2 to 4 were repeated 3 1 -times. 

The resulting PCR product (1 .4 kb) was purified as mentioned above, digested with 
Eco RI, and ligated in an Eco Rl-digested and dephosphorylated pBsk(-)-vector 
(Stratagene, La Jolla, CA, USA). 1 \x\ of the ligation mixture was used to transform E. coli 
XL-1 competent cells (Stratagene, La Jolla, CA, USA). All standard procedures were 
35 carried out as described by Sambrook et ah (1987). The DNA sequence of the constructed 
gene (fcpJO) was checked by sequencing as known in the art. 
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Exampte 3 

Increasing the thermostability of consensus phvtase-1 bv introduction of single 
mutations suggested by the amino acid sequence of consensus phytase- 10 and 
consensus phvtase-1 1 

5 In order to increase the thermostability of homologous genes, it is also possible to 

test the stability effect of each differing amino acid residue between the protein of interest 
and the calculated consensus sequence and to combine all stabilizing mutations into the 
protein of interest. We used the consensus phytase as protein of interest and tested the 
effect on the protein stability of 34 amino acid residues, differing to consensus phytase 10 
10 and/or 1 1 as single mutations. 

To construct muteins for expression in A. niger, S. cerevisiae, or K polymorphs the 
corresponding expression plasmid containing the consensus phytase gene was used as 
template for site-directed mutagenesis (see Example 6-8). Mutations were introduced using 
the "quick exchange™ site-directed mutagenesis kit" from Stratagene ( La Jolla, CA, 
15 USA) following the manufacturer's protocol and using the corresponding primers. All 
mutations made and their corresponding primers are summarized in Table 4. Plasmids 
harboring the desired mutation were identified by DNA sequence analysis as known in the 
art. 



Table 4: Primers used for site-directed mutagenesis of consensus phytase 

20 (Exchanged bases are highlighted in bold. The introduction of a restriction site is marked 
above the sequence. When a restriction site is written in parenthesis, the mentioned site 
was destroyed by introduction of the mutation.) 



mutation Primer set 

25 Kpn I 

Q50T 5 * -C ACTTGTGGGG7M CCT ACTCTCC AT ACTTCTC-3 * 

5 * -GAG AAGT ATGG AGAGT AGG7M CCCC AC AAGTG-3 ' 



Y54F 5' -GGTCAAT ACTCTCC ATTCTTCTCTTTGGAAG-3' 

30 S'-CTTCC AAAGAGAAGAATGGAGAGTATTGACC-3 ' 



E58A 5'-CATACTTCTCTTTGGCAGACGAATCTGC-3* 
5 '-GC AG ATTCGTCTGCC AAAGAGAAGT ATG-3 ' 



28 



D69K 



5 D70G 



AatU 



5'-CTCCAG^CGrCCCAAAGGACTGTAGAGTTAC-3' 
5' -GT AACTCT AC AGTCCTTTGGG/1 CGrCTGG AG-3 ' 



Aatn 



5'-CTCCAG/lCGrCCCAGACGGCTGTAGAGTTAC-3' 
5'-GTAACTCTACAGCCGTCTGGC«4CGrCTGGAG-3' 



10 



K91A 



A94K 



5 '-GATACCCAACTTCTTCTGCGTCTAAGGCTTACTCTG-3 ' 
5 * -CAGAGTAAGCCTTAGACGCAGAAGAAGTTGGGTATC-3 ' 

Seal 

5 ' -CTTCT AAGTCT AAG A/< GTA C7CTGCTTTG-3 ' 
5 ' -C AAAGC AG-4 G7MCTTCTTAGACTT AG AAG-3 ' 



15 



A 1 o 1 R 5 * -GCTT ACTCTGCTTTG ATTG AACG G ATTC AAAAG AACGCT AC-3 ' 
5 ' -GTAGCGTTCTTTTGAATCCGTTC AATC AAAGC AGAGT AAGC-3 ' 



N134Q 



20 K153N 



I158V 



25 



30 



D197N 



S187A 



T214L 



35 E222T 



5 ' -CC ATTCGGTG AACAGC AAATGGTT AACTC-3 ' 
5 ' -G AGTT AACC ATTTGCTGTTC ACCGAATGG-3 ' 

Nrul 

5 ' -G AT AC AAGGCTC rCGCG/4 GAAACATTGTTC -3 ' 
5 ' -GGAAC AATGTTTC rCGCG/lG AGCCTTGTATC-3 ' 

5 ' -GATTGTTCC ATTCGTGCGCGCTTCTGGTTC-3 ' 
5 '-GAACC AGAAGCGCGCACGAATGGAACAATC-3 ' 

Bell 

5 '-CTCCAGTTATTAACGrG/4 rC4TTCCAGAAGG-3 * 
5 ' -CCTTCTGGAArG4 TCA CGTTAATAACTGGAG-3 ' 

Apa I 

5'-GGCTGACCCAGGGGCCCAACCACACCAAGC-3' 
5 '-GCTTGGTGTGGTTGGGCCCCTGGGTC AGCC-3 ' 

Ncol 

5*-CACTTTGGACC4rGGTCTTTGTACTGCTTTCG-3' 
5 ' -CGAAAGC AGTAC AAAGACC/4 rGGTCC AAAGTG-3 ' 

Avrll 

5'-GCTTTCGAAGACTCTACCCL4GGTGACGACGTTG-3' 
5 ' -CAACGTCGTC ACC7M GGGT AGAGTCTTCG AAAGC-3 ' 
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V227A 5'-GGTGACGACGCTGAAGCTAACTTCAC-3' 
5 '-GTGAAGTTAGCTTCAGCGTCGTCACC-3 ' 

Sac n 

5 L234V 5'-CTAACTTCACCGCGGTGTTCGCTCCAG-3' 
5 '-CTGGAGCGAACACCGCGGTGAAGTTAG-3 ' 



A238P 5 ' -GCTTTGTTCGCTCC ACCTATT AGAGCTAG ATTGG-3 ' 
5 '-CC AATCTAGCTCTAATAGGTGGAGCGAACAAAGC-3 ' 

10 Hpal 

T251N 5 '-GCCAGGTG7TyL4CTTGACTGACGAAG-3 ' 
5 '-TTCGTCAGTCAAGTT/L* CACCTGGC-3' 

AatU 

' Y259N 5'-GACGAAG^CGrCGTTAACTTGATGGAC-3* 
15 5'-GTCCATCAAGTTAAC04CG7CITCGTC-3' 

Asp I 

E267D 5 '-GTCCATTCGi4C4C7*G7CGCTAGAACTT C-3 ' 
5'-GAAGTTCTAGCG/l CA GTGTCGAATGGAC-3 » 



20 E277Q 5 '-CTGACGCTACTCAGCTGTCTCC ATTC-3 ' 
5 '-GAATGGAGACAGCTGAGTAGCGTC AG-3' 



25 



30 



A283D 



H287A 



R291I 



Q292A 



35 A320V 



R329H 



5 '-GTCTCCATTCTGTGATTTGTTC ACTCAC-3 ' 
5 ' -GTGAGTGAACAAATC ACAG AATGG AG AC-3 * 

Ksp I 

5 ' -GCTTTGTTC ACCG CGGACG AATGG AG-3 ' 
5'-CTCCATTCGTCCGCGGTGAACAAAGC-3' 

Bam HI 

5 '-C ACGACGAATGG/4 rCCAATACGACTAC-3 ' 
5 '-GTAGTCGTATTGG/1 rCCATTCGTCGTG-3 ' 

Bsi ' WI 

5 '-GACGAATGGAGAGCG7/* CGACTACTTG-3 ' 
5 ' -C AAGT AGTCG7M CGCTCTCC ATTCGTC-3 ' 

Hpal 

5 '-GGTGTTGGTTTCG7T/L4 CGAATTGATTGC-3 ' 
5'-GCAATCAATTCG7X4y<CGAAACCAACACC-3' 

(Bgin) 

5 ' -GCTAGATTG ACTC4 CrC7CC AGTTC AAG-3 ' 
5 ' -CTTGAACTGGAGAGJG/1 G7C AATCT AGC-3 ' 



-30- 



EcoRV 

S364T 5 ' -CTC ACG AC AAC ACT AT GA TA rCT ATTTTCTTC- 3 ' 
5 ' -G AAG AAAAT AG4 TA rCAT AGTGTTGTCGTGAG-3 ' 

Ncol 

1366V 5'-CGACAACTCC4rGGTTTCTATTTTCTTCGC-3' 

5 ' -GCG AAGAAAATAGAAACG4 rGGAGTTGTCG-3 ' 

Kpnl 

A379K 5'-GTACAACGG7>JCCAAGCCATTGTCTAC-3' 
5'-GTAGACAATGGCTTGGZ4CCGTTGTAC-3' 



S396A 5'-CTGACGGTTACGCTGCTTCTTGGAC-3' 
5 ' -GTCC AAGAAGCAGCGT AACCGTC AG-3 ' 

G404A 5 ' -CTGTTCC ATTCGCTGCTAGAGCTTAC-3 ' 
5 ' -GTAAGCTCTAGCAGCGAATGG AAC AG-3 ' 

0415E 5 ' -GATGC AATGTG AAGCTGAAAAGG AACC-3 ' 
5 ' -GGTTCCTTTTC AGCTTC AC ATTGCATC-3 ' 

Sail 

A437G 5 ' -C ACGGTTGTGGTGrCG/lCAAGTTGGG-3' 
5'-CCCAACTTGrC04CACCACAACCGTG-3' 

Mun I 

A463E 5 ' -G ATCTGGTGGC/L4 7TGGGAGGAATGTTTCG-3' 
5'-CGAAACATTCCTCCC,4/<7TGCCACCAGATC-3' 

and accordingly for other mutations. 



The temperature optimum of the purified phytases, expressed in Saccharomyces cerevisiae 
(Example 7), was determined as outlined in Example 9. Table 5 shows the effect on the 
stability of consensus phytase for each mutation introduced. 

t .m« s- stability *»f*-rt nf the m *«M™\ amino arid raMac.Hnents in r*>n*fflimw phvtase-1 
(+ or - means a positive, respectively, negative effect on the protein stability up to 1 °C, ++ 
and - means a positive, respectively, negative effect on the protein stability between 1 and 
3 °C; the number 10 or 1 1 corresponds to the consensus phytase sequence that suggests the 
amino acid replacement.) 
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^taWilizino: 




neutral 




destabilizing 


mutation 


effect 


mutation 


effect 


mutation effect 


E58A(10) 


+ 


D69A 


± 


Y54F (10) 


D69K(11) 


+ 


D70G (10) 


± 


V73I 


D197N(10) 


+ 


N134Q(10) 


± 


A94K (10) 


T214L(10) 


+ + 


G186H 


± 


AlOlR(ll) 


E222T(11) 


+ + 


S187A(10) 


± 


K153N(11) 


E267D (10) 


+ 


T214V 


± 


I158V(10) 


R291I* 


+ 


T251N(10) 


± 


G203A 


R329H (10) 


+ 


Y259N(10) 


± 


G205S 


S364T (10) 


+ + 


A283D(10) 


± 


A217V 


A379K Tin 


+ 


A320V (10) 


± 


V227A (11) 




+ + 


K445T 


± 


L234V (10) 






A463E (10) 


± 


A238P (10) 










E277Q (10) 










H287A(11) 










Q292A(10) 










I366V (10) 










S396A(10) 










Q415E(U) 










A437G(10) 










E451R 



*: This amino acid replacement was found in another round of mutations. 
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We combined eight positive mutations (E58A, D197N, E267D, R291I, R329H, S364T, 
A379K, G404A) in the consensus phytase using the primers and the technique mentioned 
above in this example. Furthermore, the mutations Q50T and K91 A were introduced which 
mainly influence the catalytical characteristics of the phytase (see patent application EP 
5 97810175.6 and EP 971 12688 as well as Example 9). The DNA and amino acid sequence 
of the resulting phytase gene (consensus phytase-thermo[8]-Q50T-K91 A) is shown in 
Figure 7. In this way, the temperature optimum and the melting point of the consensus 
phytase was increased by 7 °C (Figure 15, 16, 17). 

Using the results of Table 5, we further improved the thermostability of consensus phytase 
10 10 by the following back mutations K94A, V158I, and A396S that revealed a strong 

negative influence on the stability of consensus phytase. The resulting protein is phytase- 
10-thermo [3]. Furthermore, we introduced the mutations Q50T and K91 A which mainly 
influence the catalytical characteristics of consensus phytase (see patent application EP 
97810175.6 and EP 971 12688 as well as Example 9 and Figure 14 and 15). The resulting 
15 DNA and amino acid sequence is shown in Figure 8. The optimized phytase showed a 4 °C 
higher temperature optimum and melting point than consensus phytase 10 (Figure 12 and 
13). Furthermore, the phytase has also a strongly increased specific activity withphytate as 
substrate of 250 U/mg at pH 5.5 (Figure 14). 

Example 4 

20 Stabilization of the phvtase of A. fumigatus ATCC 13073 by replace ment of amino 

acid resid ues with the corresponding consensus phytase- 1 and consensus phytase- 

At six typical positions where the A. fumigatus 1 3073 is the only or nearly the only 
phytase in the alignment of Figure 1 that does not contain the corresponding consensus 
25 phytase amino acid residue, the non-consensus amino acid residue was replaced by the 

consensus one. In a first round, the following amino acids were substituted in A. fumigatus 
13073 phytase, containing the Q27T substitution and the signal sequence of A, terreus 
cbs.l 16.46 phytase (see European Patent Application No. 97810175.6 and Figure 9): 

F55(28)Y, V100(73)I, Fl 14(87)Y, A243(220)L, S265(242)P, N294(282)D. 

30 The numbers in parentheses confer to the numbering of Figure 1 . 

In a second round, four of the seven stabilizing amino acid exchanges (E59A, R329H, 
S364T, G404A) found in the consensus phytase-10 sequence and, tested as single mutation 
in consensus phytase- 1 (Table 5), were additionally introduced into the A. fumigatus a- 
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mutant. Furthermore, the amino acid replacement S126N, shown to reduce the protease 
susceptibility of the phytase, was introduced. 

The mutations were introduced as described in example 3 (see Table 6) and expressed as 
described in example 6 to 8. The resulting A. fumigatus 13073 phytase variants were called 
5 a-mutant and fJ-mutant-E59A-S126N-R329H-S364T-G404A. 

The temperature optimum (60 °C, Figure 20) and the melting point (67.0 °C, Figure 19) of 
the A. fumigatus 13073 phytase 15-mutant was increased by 5 °C in comparison to the 
values of the wild-type (temperature optimum: 55 °C, T m : 60 °C). The five additional 
amino acid replacements further increased the temperature optimum by 3 °C (Figure 20). 

10 Table 6: Mutagenesis primers for s tabilization of A. fumigatus phvtase ATCC 13073 
Mutation Primer 

F55Y 5 ' -CACGTACTCGCC ATACTTTTCGCTCGAG-3 ' 



5 '-CTCG AGCGAAAAGTATGGCG AGTACGTG-3 ' 



15 E58A 



(XhoY) 

5*-CCATACTTITCGCrCGCGGACGAGCTGTCCGTG-3' 
5 '-C ACGGAC AGCTCGTCCG CG/1 GCGAAAAGT AGG-3 ' 



VI 001 



5'-GTATAAGAAGCTTATTACGGCGATCCAGGCC-3' 
5 ' -GGCCTGGATCGCCGT AAT AAGCTTCTTATAC-3 ' 



20 



F114Y 



5 '-CTTCAAGGGCAAGTACGCCTTTTTGAAGACG-3' 
5 '-CGTCTTCAAAAAGGCGTACTTGCCCTTGAAG-3* 



25 



A243L 



5 '-CATCCGAGCTCGCCTCGAGAAGCATCTTC-3 ' 
5 '-GAAGATGCTTCTCGAGGCGAGCTCGGATG-3 ' 



S265P 



5 '-CTAATGGA TGTGTCCGTTTGATACGGTAG-3 ' 
5 ' -CTACCGTATC AAACGGAC AC ATGTCC ATT AG-3 ' 
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N294D 5 ' -GTGGAAGAAGTACG ACTACCTTCAGTC-3 ' 
5 '-GACTGAAGGTAGTCGTACTTCTTCCAC-3 ' 

(Mlu I) 

5 R329H 5'-GCCCGGTTG^CGC4JTCGCCAGTGCAGG-3' 
5'-CCTGCACTGGCGA^rGCG7TCAACCGGGC-3' 



10 



Ncol 

S364T S'-CACACGACAACACC^rGGTTTCCATCTTC-S' 
5 ' -GAAG ATGGAAACCAIQfiTGTTGTCGTGTG-3 ' 

(Bss HI) 

G404A 5 '-GTGGTGCCTTTCGCCGCGCGAGCCTACTTC-3 ' 
5'-GAAGTAGGCTCGCGCG<7CGAAAGGCACCAC-3' 



Example 5 

Introduct ion of the active site amino acid residues of the A. niger NRRL 3 1 35 
15 phytase into the consensus phvtase-1 

We used the crystal structure of the Aspergillus niger NRRL 3 135 phytase to define 
all active site amino acid residues (see Reference Example and EP 97810175.6). Using the 
alignment of Figure 1, we replaced the following active site residues and additionally the 
not identical adjacent ones of the consensus phytase by that of the .4. niger phytase: 

20 S89D, S92G, A94K, D164S, P201S, G203A, G205S, H212P, G224A, D226T, E255T, 

D256E, V258T, P265S, Q292H, G300K, Y305H, A314T, S364G, M365I, A397S, S398A, 
G404A, and A405S 

The new protein sequence consensus phytase -7 was backtranslated into a DNA sequence 
(Figure 10) as described in Example 1. The corresponding gene (fcp7) was generated as 
25 described in Example 1 using the following oligonucleotide mixes: 

Mix 1.7: CP-1, CP-2, CP-3, CP-4.7, CP-5.7, CP-6, CP-7, CP-8.7, CP-9, CP-10.7 

Mix 2.7: CP-9, CP-10.7, CP-1 1.7, CP-12.7, CP-13.7, CP-14.7, CP-15.7, CP-16, CP-17.7, 
CP-18.7, CP-19.7, CP-20, CP-21, CP-22. 

The DNA sequences of the oligonucleotides are indicated in Figure 3. The newly 
30 synthesized oligonucleotides are additionally marked by number 7. After assembling of the 
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oligonucleotides using the same PCR primers as mentioned in Example 1, the gene was 
cloned into an expression vector as described in Examples 6-8. 

The pH-profile determined after expression in H, polymorpha and purification was shifted 
into the acidic range of the pH-spectrum showing an optimum at pH 4.5-5.0 (see Figure 
5 18). The enzyme had a broad pH-optimum reaching at least 60% of its maximum activity 
from pH 2.5 to pH 6.0. Up to pH 5.0, the profile resembled the profile of the A. niger 
NRRL 3135 phytase. However, below pH 5.0 it lacked the typical low at pH 4.0 of the 
profile of A. niger phytase. 

Fxample 6 

10 Fxpression of the con s ensus phvtase genes in Hansenula volvmorvha 

The phytase expression vectors, used to transform K polymorpha RBI 1 (Gellissen et 
al. 9 1994), was constructed by inserting the Eco RI fragment of pBskr/cp or variants 
thereof into the multiple cloning site of the H. polymorpha expression vector pFPMT121, 
which is based on an ura3 selection marker from S. cerevisiae, a formate dehydrogenase 
15 (FMD) promoter element and a methanol oxidase (MO) termimator element from H. 
polymorpha. The 5' end of the fcp gene is fused to the FMD promoter, the 3' end to the 
MOT terminator (Gellissen et ah, 1996; EP 0299 108 B). The resulting expression vector 
are designated pFPMT/c/>, P FPMT/cp70, pFPMT/c/?7. 

The constructed plasmids were propagated in E. colL Plasmid DNA was purified 

20 using standard state of the art procedures. The expression plasmids were transformed into 
the H. polymorpha strain RP1 1 deficient in orotidine-5* -phosphate decarboxylase (ura3) 
using the procedure for preparation of competent cells and for transformation of yeast as 
described in Gelissen et al. (1996). Each transformation mixture was plated on YNB 
(0.14% w/v Difco YNB and 0.5% ammonium sulfate) containing 2% glucose and 1.8% 

25 agar and incubated at 37 °C. After 4 to 5 days individual transformant colonies were picked 
and grown in the liquid medium described above for 2 days at 37 °C. Subsequently, an 
aliquot of this culture was used to inoculate fresh vials with YNB-medium containing 2% 
glucose. After seven further passages in selective medium, the expression vector integrates 
into the yeast genome in multimeric form. Subsequently, mitotically stable transformants 

30 were obtained by two additional cultivation steps in 3 ml non-selective liquid medium 
(YPD, 2% glucose, 10 g yeast extract, and 20 g peptone). In order to obtain genetically 
homogeneous recombinant strains an aliquot from the last stabilization culture was plated 
on a selective plate. Single colonies were isolated for analysis of phytase expression in 
YNB containing 2% glycerol instead of glucose to derepress the fmd promoter. Purification 

35 of the consensus phytases was done as described in Example 7. 
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Example 7 

Expression Of the consensus phvtase genes i n Saccharnmvces cerevisiae and 
purification of the phvtases from cult ure sup ernatant 

The consensus phytase genes were isolated from the corresponding Bluescript- 
plasmid (pBsk/c/?, pBSK/cp/0, pBsk'fcp7) and ligated into the Eco RI sites of the 
expression cassette of the Saccharomyces cerevisiae expression vector pYES2 (Invitrogen, 
San Diego, CA, USA) or subcloned between the shortened GAPFL (glyceraldhyde-3- 
phosphate dehydrogenase) promoter and the phoS terminator as described by Janes et al. 
(1990). The correct orientation of the gene was checked by PCR. Transformation of S. 
cerevisiae strains, e. g. INVScl (Invitrogen, San Diego, CA, USA) was done according to 
Hirmen et al (1978). Single colonies harboring the phytase gene under the control of the 
GAPFL promoter were picked and cultivated in 5 ml selection medium (SD-uracil, 
Sherman et al. y 1986) at 30°C under vigorous shaking (250 rpm) for one day. The 
preculture was then added to 500 ml YPD medium (Sherman et aL 9 1986) and grown 
under the same conditions. Induction of the gall promoter was done according to 
manufacturer's instruction. After four days of incubation cell broth was centrifuged (7000 
rpm, GS3 rotor, 15 min, 5°C) to remove the cells and the supernatant was concentrated by 
way of ultrafiltration in Amicon 8400 cells (PM30 membranes) and ultrafree-15 
centrifugal filter devices (Biomax-30K, Millipore, Bedford, MA, USA). The concentrate 
(10 ml) was desalted on a 40 ml Sephadex G25 Superfine column (Pharmacia Biotech, 
Freiburg, Germany), with 10 mM sodium acetate, pH 5.0, serving as elution buffer. The 
desalted sample was brought to 2 M (NH^SCX* and directly loaded onto a 1 ml Butyl 
Sepharose 4 Fast Flow hydrophobic interaction chromatography column (Pharmacia 
Biotech, Feiburg, Germany) which was eluted with a linear gradient from 2 M to 0 M 
(NH4) 2 S0 4 in 10 mM sodium acetate, pH 5.0. Phytase was eluted in the break-through, 
concentrated and loaded on a 120 ml Sephacryl S-300 gel permeation chromatography 
column (Pharmacia Biotech, Freiburg, Germany). Consensus phytase and consensus 
phytase -7 eluted as a homogeneous symmetrical peak and was shown by SDS-PAGE to be 
approx. 95% pure. 

Example 8 

Expression of the consensus phytase genes in Aspergillus niver 

The Bluescript-plasmids pBsk'/cp, pBSK'fcplO, and pBsk'fcp7 were used as template 
for the introduction of a Bsp Hi-site upstream of the start codon of the genes and an Eco 
RV-site downstream of the stop codon. The Expand™ High Fidelity PCR Kit (Boehringer 
Mannheim, Mannheim, Germany) was used with the following primers: 
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Primer Asp-1: 

Bsp HI 

S'-TATArC^rG^GCGTGTTCGTCGTGCTACTGTTC-S' 

Primer Asp-2 used for cloning of fcp and fcpl\ 
5 Eco RV 

3'-ACCCGACTT AC AAAGCGAATTCZ4 TA GATATAT-5' 

Primer Asp-3 used for cloning of fcp] 0: 

EcoKV 

3 ' - ACCCTTCTT AC AAAGCGAATTC7/1 W GAT AT AT- 5 ' 

10 The reaction was performed as described by the supplier. The PCR-amplified fcp- 

genes had a new Bsp HI site at the start codon, introduced by primer Asp-1, which resulted 
in a replacement of the second amino acid residue glycine by serine. Subsequently, the 
DNA-fragment was digested with Bsp HI and Eco RV and ligated into the Nco I site 
downstream of the glucoamylase promoter of Aspergillus niger (glaA) and the Eco RV site 

15 upstream of the Aspergillus nidulans tryptophan C terminator (trpC) (Mullaney et al. 9 
1985). After this cloning step, the genes were sequenced to detect possible failures 
introduced by PCR. The resulting expression plasmids which basically corresponds to the 
pGLAC vector as described in Example 9 of EP 684 313, contained the orotidine-5'- 
phosphate decarboxylase gene (pyr4) of Neurospora crassa as a selection marker. 

20 Transformation of Aspergillus niger and expression of the consensus phytase genes was 
done as described in EP 684 313. The consensus phytases were purified as described in 
Example 7. 



Example 9 

Determination of phy tase activity and of temperature optimum 

25 Phytase activity was determined basically as described by Mitchell et al (1997). The 

activity was measured in an assay mixture containing 0.5% phytic acid («5 mM) in 200 
mM sodium acetate, pH 5.0. After 15 min of incubation at 37 °C, the reaction was stopped 
by addition of an equal volume of 15% trichloroacetic acid. The liberated phosphate was 
quantified by mixing 100 ^1 of the assay mixture with 900 fil H2O and 1 ml of 0.6 M 

30 H2SO4, 2% ascorbic acid and 0.5% ammonium molybdate. Standard solutions of 

potassium phosphate were used as reference. One unit of enzyme activity was defined as 
the amount of enzyme that releases 1 ^imol phosphate per minute at 37 °C. The protein 
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concentration was determined using the enzyme extinction coefficient at 280 nm calculated 
according to Pace et al (1995): consensus phytase, 1.101; consensus phytase 7, 1.068; 
consensus phytase 10, 1.039. 

In case of pH-optimum curves, purified enzymes were diluted in 10 mM sodium acetate, 
5 pH 5.0. Incubations were started by mixing aliquots of the diluted protein with an equal 
volume of 1% phytic acid («10 mM) in a series of different buffers: 0.4 M glycine/HCl, pH 
2.5; 0.4 M acetate/NaOH, pH 3.0, 3.5, 4.0, 4.5, 5.0, 5.5; 0.4 M imidazoIe/HCl, pH 6.0, 6.5; 
0.4 M Tris/HCl pH 7.0, 7.5, 8.0, 8.5, 9.0. Control experiments showed that pH was only 
slightly affected by the mixing step. Incubations were performed for 15 min at 37 °C as 
10 described above. 

For determinations of the substrate specificities of the phytases, phytic acid in the 
assay mixture was replaced by 5 mM concentrations of the respective phosphate 
compounds. The activity tests were performed as described above. 

For determination of the temperature optimum, enzyme (100 and substrate 
15 solution (100 nl) were pre-incubated for 5 min at the given temperature. The reaction was 
started by addition of the substrate solution to the enzyme. After 15 min incubation, the 
reaction was stopped with trichloroacetic acid and the amount of phosphate released was 
determined. 

The pH-optimum of the original consensus phytase was around pH 6.0-6.5 (70 
20 U/mg). By introduction of the Q50T mutation, the pH-optimum shifted to pH 6.0 (1 30 
U/mg). After introduction of K91A, the pH optimum shifted one pH-unit into the acidic 
pH-range showing a higher specific activity between pH 2.5 and pH 6.0. That was shown 
for the stabilized mutants and for consensus phytase- 10, too (Figure 14 and 15). 

Consensus phytase-7, which was constructed to transfer the catalytic characteristics of the 
25 A. niger phytase NRRL 3 1 35 into the consensus phytase, had a pH-profile which is shifted 
into the acidic range of the pH-spectrum showing an optimum between pH 4.5 and 5.0 (see 
Figure 19). The enzyme had a broad pH-optimum reaching at least 60% of its increased 
maximum activity from pH 2.5 to pH 6.0. The substrate spectrum, too, resemble more to 
that of the A. niger NRRL 3135 phytase than to the consensus phytase- 1 . 

30 The temperature optimum of consensus phytase-1 (71 °C) was 16-26 °C higher than 

the temperature optimum of the wild-type phytases (45-55 °C, Table 7) which were used to 
calculate the consensus sequence. The improved consensus phytase- 10 showed a further 
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increase of its temperature optimum to 80 °C (Figure 1 1). The temperature optimum of the 
consensus phytase-l-thermo[8] was found in the same range (78 °C) using the supernatant 
of an overproducing S. cerevisiae strain. The highest temperature optimum reached of 82 ° 
C was determined for consensus phytase-10-thermo-Q50T-K91 A. 

Table 7: Temperature optimum and r m -value of consensus phytase and of the phytases 
from A.fumigatus, A. niger, E. nidulans, and M. thermophila. The determination of the 
temperature optimum was performed as described in Example 9. The T^-values were 
determined by differential scanning calorimetry as described in Example 10. 



phytase 


temperature 
optimum [°C] 


7m 
[°C] 


Consensus phytase- 10- 


82 


89.3 


Consensus phytase- 10- 
tnernio-v^ju i 


82 


88.6 


i^onbenbUo pxiyiaoc-iv 


80 


85.4 


consensus pnyuis>e-i- 
thermo[8]-Q50T 


78 


84.7 


Consensus phytase- 1- 
thermo[8]-Q50T-K91A 


78 


85.7 


Consensus phytase- 1 


71 


78.1 


A. wgerNRRL3135 


55 


63.3 


A.fumigatus 13073 


55 


62.5 


A.fumigatus 13073 
a-mutant 


60 


67.0 


A.fumigatus 13073 
a-mutant (optimized) 


63 




A. terreus 9A-1 


49 


57.5 


A. terreus cbs.l 16.46 


45 


58.5 


E. nidulans 


45 


55.7 


M. thermophila 


55 


n. d. 


T. thermophilus 


45 


n. d. 
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Example 1Q 

Determination of the melting point by differential scanning calorimgtrv IPS O 

In order to determine the unfolding temperature of the phytases, differential scanning 
5 calorimetry was applied as previously published by Brugger et al (1997). Solutions of 50- 
60 mg/ml homogeneous phytase were used for the tests. A constant heating rate of 1 0 ° 
C/min was applied up to 90-95 °C 

The determined melting points reflect the results obtained for the temperature 
optimums (Table 7). The most stable consensus phytase designed is consensus phytase-10- 
10 thermo-Q50T-K91 A showing a melting temperature under the choosen condition of 89.3 ° 
C. This is 26 to 33.6 °C higher than the melting point of the wild-type phytases used. 

Example 1 1 

Transfer of frasidiomvcete phvtase active site into consensus phvtase-10-thermo- 
Q50T-K91A 

15 As described previously (Example 3), mutations derived from the basidiomycete 

phytase active site were introduced into the consensus phytase 10. The following five 
constructs a) to e) were prepared: 

a) This construct is called consensus phytase 12, and it comprises a selected number of 
active site residues of the basidio consensus sequence, its amino acid sequence 

20 (consphyl2) is shown in Fig. 21 (the first 26 amino acids forms the signal peptide, 
amended positions are underlined); 

b) a cluster of mutations (Cluster II) was transferred to the consensus 10 sequence, viz.: 
S80Q, Y86F, S90G, K91 A, S92A, K93T, A94R, Y95I; 

c) analogously, another cluster of mutations (Cluster III) was transferred, viz.: T129V, 
25 E133A, Q143N, M136S, V137S, N138Q, S139A; 

d) analogously, a further cluster of mutations (Cluster IV) was transferred, viz.: A168D, 
E171T, K172N,F173W; 
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e) and finally, a further cluster of mutations (Cluster V) was transferred, viz.: Q297G, 
S298D, G300D, Y305T. 

These constructs were expressed as described in Examples 6 to 8. 
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Claims 

1. A process for the preparation of a consensus protein, whereby such process is 
characterized by the following steps: 

a) at least three, preferably four amino acid sequences are aligned by any standard 
alignment program known in the art, 

b) amino acids at the same position according to such alignment are compared 
regarding their evolutionary similarity by any standard program known in the 
art, whereas the degree of similarity provided by such a program which defines 
the least similarity of the amino acids that is used for the determination of an 
amino acid of corresponding positions is set to a less stringent number and the 
parameters are set in such a way that it is possible for the program to determine 
from only 2 identical amino acids at a corresponding position an amino acid for 
the consensus protein; however, if among the compared amino acid sequences 
are sequences that show a much higher degree of similarity to each other than 
to the residual sequences, these sequences are represented by their consensus 
sequence determined as defined in the same way as in the present process for 
the consensus sequence of the consensus protein or a vote weight of 1 divided 
by the number of such sequences is assigned to every of -those sequences, 

c) in case no common amino acid at a defined position is identified by the program, 
any of the amino acids, preferably the most frequent amino acid of all such 
sequences is selected, 

d) once the consensus sequence has been defined, such sequence is back-translated 
into a DNA sequence, preferably by using a codon frequency table of the 
organism in which expression should take place, 

e) the DNA sequence is synthesized by methods known in the art and used either 
integrated into a suitable expression vector or by itself to transform an 
appropriate host cell, 

f) the transformed host cell is grown under suitable culture conditions and the 
consensus protein is isolated from the host cell or its culture medium by methods 
known in the art. 

2. A process as claimed in claim 1 wherein the program used for the comparison of 
amino acids at a defined position regarding their evolutionary similarity is the program 
"PRETTY". 
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3. A process as claimed in claims 1 or 2, wherein 

in a first step a consensus sequence is determined from a number of highly homologous 
sequences according to steps a), b) and c) of claim 1, 

in a second step the amino acid sequence of another protein which is homologous to the 
consensus sequence is compared with the consensus sequence and 

in a third step only those amino acid residues are replaced in the amino acid sequence of 
the other protein which clearly differ from the consensus sequence of this protein family 
calculated under moderately stringent conditions whereas at all positions of the alignment 
where no preferred single amino acid can be determined under moderately stringent 
conditions the amino acids of the other protein remain unchanged. 

4. A process as claimed in any one of claims 1-3, wherein 

in a first step a consensus sequence is determined from homologous sequences according 
to steps a), b) and c) of claim 1, 

in a second step the active center of the protein comprising all amino acid residues that are 
involved in forming the active center is determined in the consensus sequence and in the 
sequence of a homologous protein as well and 

in a third step some or all of the amino acids that form the active center of the 
homologeous protein are inserted in the backbone of the consensus sequence. 

5. A process as claimed in claim 4, wherein the active center of the protein is 
determined by using an analysis of the three-dimensional structure of the protein. 

6. A process as claimed in claims 4 and 5, wherein the homologeous protein is an 
enzyme. 

7. A process as claimed in claims 1 to 6, wherein the defined protein family is the 
family of phytases. 

8. A process as claimed in claim 7, wherein the phytases are of fungal origin. 

9. A process as claimed in claims 7 or 8, wherein the amino acid sequence of the 
phytase is changed by the introduction of at least one mutation selected from the group 
consisting of 
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E58A 


F54Y 


D69K 


I73V 


D197N 


K94A 


T214L 


R101A 


E222T 


N153K 


E267D 


VI 581 


R29II 


A203G 


R329H 


S205G 


S364T 


V217A 


A379K 


A227V 


G404A 


V234L 




P238A 




Q277E 




A287H 




A292Q 




V366I 




A396S 




E415Q 




G437A 




E451R 



whereby the number represents the position in the consensus phytase sequence or a 
corresponding residue according to an alignment as shown in Fig. 1 when 26 amino acids 
(signal sequence) are added to the sequences shown in Fig. 1 and the letter before the 
number represents the amimo acid in the phytase which is replaced by the amino acid 
behind the number. 

10. A process as claimed in any one of claims 1 to 9, wherein the host cell is of 
eukaryotic origin. 
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11. A process as claimed in claim 10, wherein eukaryotic means fungal, preferably 
Aspergillus or yeast, preferably Saccharomyces or Hansenula. 

12. A consensus protein obtainable preferably obtained by a process as claimed in 
any one of claims 1 to 1 1 . 

13. A consensus protein which comprises the amino acid sequence shown in Figure 2 
or any variants or muteins thereof (consensus phytase-1). 

14. A mutein of the consensus protein of claim 13 characterized therein that in the 
amino acid sequence of Figure 2 the following replacements have been effected Q50L, 
Q50T, Q50G, Q50T-Y51N, Q50L-Y51N or Q50T-K91A. 

15. A consensus protein which comprises the amino acid sequence shown in Figure 4 
having the designation consensus phytase 10 (FcplO) and any variants or muteins thereof. 

16. A consensus protein which comprises the amino acid sequence shown in Figure 6 
having the designation Consensus seq. 1 1 and any variant or mutein thereof. 

17. A consensus protein which comprises the amino acid sequence shown in Figure 
10 (consensus phytase 7) and any variant or mutein thereof. 

1 8. A consensus protein which comprises the amino acid sequence shown in Figure 
21 (consensus phytase 12) and any variant or mutein thereof. 

19. A consensus protein which comprises the amino acid sequence shown in Figure 3 
(basidio consensus) and any variant or mutein thereof. 

20. A phytase being selected from amongst: A. fumigatus ATCC 13073 alpha- 
mutant; A. fumigatus ATCC 13073 alpha-mutant-(E59A-S126N-R329H, S364T-G404A)- 
Q27T; A. fumigatus ATCC 13073 alpha-mutant-(E59A, S126N-R329H-S364T-G404A)- 
Q27T-K68A, preferably the latter. 

21. A food, feed or pharmaceutical composition comprising a consensus protein as 
claimed in any of the claims 12 to 17. 



Figure 1 
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Modtaget PD 
22 JAN. 1999 



50 

5 A. terreus 9A-1 KhsDCNSVDh GYQCFPELSH kWGlYAPYFS 

LQDESPFP1D VPEDChlTFV 

A. terreus cbs NhsDCTSVDr GYQCFPELSH kWGlYAPYFS 

LQDESPFP1D VPDDChlTFV 

A. niger var. awamori Nqs TCDTVDQ GYQCFSETSH LWGQYAPFFS 
10 LANES AISPD VPAGCrVTFA 

A. niger T213 NqsSCDTVDQ GYQCFSETSH LWGQYAPFFS 

LANESVISPD VPAGCrVTFA 

A. niger NRRL3135 NqsSCDTVDQ GYQCFSETSH LWGQYAPFFS 

LANESVISPE VPAGCrVTFA 
15 A. fumigatus 13073 GSkSCDTVDl GYQCsPATSH LWGQYSPFFS 
LEDE1SVSSK LPKDCrlTLV 

A. fumigatus 32722 GSkSCDTVDl GYQCsPATSH LWGQYSPFFS 
LEDE1SVSSK LPKDCrlTLV 

A. fumigatus 58128 GSkSCDTVDl GYQCsPATSH LWGQYSPFFS 
20 LEDE1SVSSK LPKDCrlTLV 

A. fumigatus 26906 GSkSCDTVDl GYQCsPATSH LWGQYSPFFS 
LEDElSVSSK LPKDCrlTLV 

A. fumigatus 32239 GSkACDTVEl GYQCsPGTSH LWGQYSPFFS 
LEDE1SVSSD LPKDCrVTFV 
25 E. nidulans QNHS CNTADG GYQCFPNVSH VWGQYSPYFS 

IEQESAISeD VPHGCeVTFV 

T. thermophilus DSHSCNTVEG GYQCrPEISH sWGQYSPFFS 

LADQSEISPD VPQNCklTFV 

M. thermophila ESRPCDTpDl GFQCgTAISH FWGQYSPYFS 

30 VpSElDaS. . I PDDCe VTFA 

Consensus NSHS CDTVDG GYQCFPEISH LWGQYSPYFS 

LEDESAISPD VPDDC-VTFV 

Consensus pbytase NSHSCDTVDG GYQCFPEISH LWGQYSPYFS 

35 LEDESAISPD VPDDCRVTPV 



100 

40 A. terreus 9A-1 QVLARHGARs PThSKtKAYA AtlAAIQKSA 

Ta FpGKY AFL QSYNYSLDSE 

A. terreus cbs QVLARHGARs PTDSKt KAY A AtlAAIQKNA 

TaLpGKYAFL KSYNYSMGSE 

A. niger var. awamori QVLSRHGARY PTE S Kg Kk Y S ALIEEIQQNV 
45 TtFDGKYAFL KTYNYSLGAD 

A. niger T213 QVLSRHGARY PTESKgKkYS ALIEEIQQNV 

TtFDGKYAFL KTYNYSLGAD 

A. niger NRRL3135 QVLSRHGARY PTDSKgKkYS ALIEEIQQNA 

TtFDGKYAFL KTYNYSLGAD 
50 A . fumigatus 13073 QVLSRHGARY PTSSKsKkYK kLVTAlQaNA 
TdFKGKFAFL KTYNYTLGAD 

A. fumigatus 32722 QVLSRHGARY PTSSKsKkYK kLVTAlQaNA 

TdFKGKFAFL KTYNYTLGAD 

A. fumigatus 58128 QVLSRHGARY PTSSKsKkYK kLVTAlQaNA 
55 TdFKGKFAFL KTYNYTLGAD 
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A. fumigatus 2 6906 QVLSRHGARY PTSSKsKkYK kLVTAIQaNA 
TdFKGKFAFL KTYNYTLGAD 

A. fumigatus 32239 QVLSRHGARY PTASKsKkYK kLVTAIQKNA 
TeFKGKFAFL ETYNYTLGAD 
5 E. nidulans QVLSRHGARY PTESKsKAYS GLIEAIQKNA 

TsFwGQYAFL ESYNYTLGAD 

T. thermophilus QLLSRHGARY PTSSKtElYS QLISrIQKTA 

TaYKGyYAFL KDYrYqLGAN 

M. thermophila QVLSRHGARa PTIKRaaSYv DLIDrlHhGA 

10 IsYgPgYEFL RTYDYTLGAD 

Consensus QVLSRHGARY PTSSK-KAYS ALIEAIQKNA T- 

FKGKYAFI* KTYNYTLGAD 

Consensus phytase QVLSRHGARY PTSSKSKAYS ALIEAIQKNA 

15 TAFKGKYAFL KTYNYTLGAD 
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150 

20 A. terreus 9A-1 ELTPFGrNQL rDlGaQFYeR YNALTRhlnP 

FVRATD AS R V hESAEKFVEG 

A. terreus cbs NLTPFGrNQL qDlGaQFYRR YDTLTRhlnP 

FVRAADSSRV hESAEKFVEG 

A . niger var. awamori DLTPFGEQEL VNSGIKFYQR YESLTRNIIP 
25 FIRSSGSSRV IASGEKFIEG 

A . niger T213 DLTPFGEQEL VNSGIKFYQR YESLTRNIIP 

FIRSSGSSRV IASGEKFIEG 

A. niger NRRL3135 DLTPFGEQEL VNSGIKFYQR YESLTRNIVP 

FIRSSGSSRV IASGKKFIEG 
30 A, fumigatus 13 073 DLTPFGEQQL VNSGIKFYQR YKALARSWP 
FIRASGSDRV IASGEKFIEG 

A. fumigatus 32 722 DLTPFGEQQL VNSGIKFYQR YKALARSWP 

FIRASGSDRV IASGEKFIEG 

A. fumigatus 5812 8 DLTPFGEQQL VNSGIKFYQR YKALARSWP 
35 FIRASGSDRV IASGEKFIEG 

A . fumigatus 26906 DLTAFGEQQL VNSGIKFYQR YKALARSWP 
FIRASGSDRV IASGEKFIEG 

A . fumigatus 3223 9 DLTPFGEQQM VNSGIKFYQK YKALAgSWP 
FIRSSGSDRV IASGEKFIEG 
40 E. nidulans DLTi FGENQM VDSGaKFYRR YKNLARKnTP 

FIRASGSDRV VASAEKFING 

T. thermophilus DLTPFGENQM IQlGIKFYnH YKSLARNaVP 

FVRCSGSDRV IASGrlFIEG 

M. thermophila ELTRtGQQQM VNSG I KFYRR YRALARKsIP 

45 FVRTAGqDRV VhSAENFTQG 

Consensus DLTPFGENQM VNSG I KFYRR YKALARK-VP 

FVRASGSDRV IASAEKFIEG 

Consensus phytase DLTPFGENQM VNSGI KFYRR YKALARKIVP 

50 FIRASGSDRV IASAEKFIEG 
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151 

200 

A. terreus 9A-1 FQTARqDDHh ANpHQPSPrV DValPEGSAY 

NNTLEHS1CT AFES . . .STV 
5 A. terreus cbs PQNARqGDPh ANpHQPSPrV DWIPEGTAY 

NNTLEHSICT AFEA. . . STV 

A. niger var. awamori FQSTKLkDPr AqpgQSSPkl DWISEASSs 
NNTLDPGTCT VFED . . . SEL 

A. niger T213 FQSTKLkDPr AqpgQSSPkl DWISEASSs 

10 NNTLDPGTCT VFED. . .SEL 

A. niger NRRL3135 FQSTKLkDPr AqpgQSSPkl DWISEASSs 

NNTLDPGTCT VFED . . . SEL 

A. fumigatus 13 073 FQqAKLADPG A.TNRAAPAI SVIIPESETF 

NNTLDHGVCT kFEA. . . SQL 
15 A. fumigatus 32722 FQqAKLADPG A.TNRAAPAI SVIIPESETF 
NNTLDHGVCT kFEA. . .SQL 

A. fumigatus 58128 FQqAKLADPG A.TNRAAPAI SVIIPESETF 

NNTLDHGVCT kFEA. . . SQL 

A. fumigatus 26906 FQqAKLADPG A.TNRAAPAI SVIIPESETF 

20 NNTLDHGVCT kFEA. . . SQL 

A. fumigatus 32239 FQqANVADPG A . TNRAAPVI SVIIPESETY 

NNTLDHSVCT NFEA. . .SEL 

E. nidulans FRKAQLhDHG S..gQATPW NVIIPEiDGF 

NNTLDHS TCV SFEN. . . DEr 
25 T. thermophilus FQSAKVlDPh SDkHDAPPTI NVIIeEGPSY 

NNTLDtGSCP VFED. . . SSg 

M. thermophila FHSA1LADRG STvRPTlPyd mWI PETAGa 

NNTLHND1CT AFEEgpySTI 

30 Consensus FQSAKLADPG S-PHQASPVI NVIIPEGSGY 

NNTLDHGTCT AFED SEL 

Consensus phytaso FQSAKLADPG SQPHQASPVX DVIIPEGSGY 

NNTLDHGTCT AFED. . . SEL 

35 

201 

250 

A. terreus 9A-1 GDDAvANFTA VFAPAIaQRL EADLPGVqLS 

TDDWnLMAM CPFETVS1TD 
40 A. terreus cbs GDAAADNFTA VFAPAIakRL EADLPGVqLS 

ADDWnLMAM CPFETVSlTD 

A. niger var. awamori ADTVEANFTA TFAPSIRQRL ENDLSGVTLT 
DTEVTyLMDM CSFDTIStST 

A. niger T213 ADTVEANFTA TFAPSIRQRL ENDLSGVTLT 

45 DTEVTyLMDM CSFDTIStST 

A. niger NRRL3135 ADTVEANFTA TFVPSIRQRL ENDLSGVTLT 

DTEVTyLMDM CSFDTIStST 

A. fumigatus 13073 GDEVAANFTA 1 FAPDIRARa EkHLPGVTLT 

DEDWs LMDM CSFDTVARTS 
50 A. fumigatus 32722 GDEVAANFTA 1 FAPDIRARa EkHLPGVTLT 

DEDWsLMDM CSFDTVARTS 

A. fumigatus 5 812 8 GDEVAANFTA 1 FAPDIRARa EkHLPGVTLT 
DEDWs LMDM CSFDTVARTS 

A. fumigatus 26906 GDEVAANFTA 1 FAPDIRARa KkHL PGVTLT 
55 DEDWs LMDM CSFDTVARTS 

A. fumigatus 32239 GDEVEANFTA 1 FAPAIRARI EkHLPGVqLT 
DDDWs LMDM CSFDTVARTA 
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E. nidulans ADEiEANFTA IMGPPIRJcRL ENDLPGIKLT 

NENVIyLMDM CSFDTMARTA 

T. thermophilus GHDAQEKFAk qFAPAIlEKI KDHLPGVDLA 

vSDVpyLMDL CPFETLARNh 
5 W. thermophila GDDAQDTY1S TFAGPItARV NANLPGANLT 

DADTVaLMDL CPFETVAsSS 

Consensus GDDAEANFTA TFAPAIRARL EADLPGVTLT DEDW- 

LMDM CPFETVARTS 
10 Consensus phytase GDDVEANFTA LFAPAIRARL EADLPGVTLT 

DBDWYLHDH CPFETVARTS 



251 

15 300 

A. terreus 9A-1 DAhTLSPFC DLFTAtEWtq 

YNYL1SLDKY YGYGGGNPLG 

A . terreus cbs DAhTLSPFC DLFTAaEWtq 

YNYLlSLDKY YGYGGGNPLG 

20 A. niger var. awamori vDTKLSPFC DLFTHdEWih 

YDYLQSLkKY YGHGAGNPLG 

A. niger T213 vDTKLSPFC DLFTHdEWih 

YDYLRSLkKY YGHGAGNPLG 

A. niger NRRL3135 vDTKLSPFC DLFTHdEWin 

25 YDYLQSLkKY YGHGAGNPLG 

A. fumigatUS 13073 DASQLSPFC QLFTHnEWkk 

YNYLQSLGKY YGYGAGNPLG 

A. fumigatus 32722 DASQLSPFC QLFTHnEWkk 

YNYLQSLGKY YGYGAGNPLG 

30 A. fumigatus 58128 DASQLSPFC QLFTHnEWkk 

YNYLQSLGKY YGYGAGNPLG 

A. fumigatus 26906 DASQLSPFC QLFTHnEWkk 

YNYLQSLGKY YGYGAGNPLG 

A. fumigatus 32239 DASELSPFC AIFTHnEWkk 

35 YDYLQSLGKY YGYGAGNPLG 

E. nidulans HGTELSPFC AIFTEkEWlq 

YDYLQSLSKY YGYGAGSPLG 

T. thermophilus TDT.LSPFC ALsTQeEWqa 

YDYYQSLGKY YGnGGGNPLG 
40 M. thermophila sdpatadagg gNGrpLSPFC rLFSEsEWra 

YDYLQSVGKW YGYGPGNPLG 

Consensus -DATELSPFC ALFTE-EW- - 

YDYLQSLGKY YGYGAGNPLG 

45 Consensus phytase DATELSPFC ALFTHDEWRQ 

YDYLQSLGKY YGYGAGNPLG 
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350 

A. terreus 9A-1 PVQGVGWaNE LMARLTRAPV HDHTCVNNTL 

DASPATFPLN ATLYADFSHD 
5 A. terreus Cbs PVQGVGWaNE LIARLTRSPV HDHTCVNNTL 

DANPATFPLN ATLYADFSHD 

A. niger var. awamori PTQGVGYaNE LIARLTHSPV HDDTS SNHTL 
DSNPATFPLN STLYADFSHD 

A. niger T213 PTQGVGYaNE LIARLTHSPV HDDTS SNHTL 

10 DSNPATFPLN STLYADFSHD 

A. niger NRRL3135 PTQGVGYaNE LIARLTHSPV HDDTS SNHTL 

DSSPATFPLN STLYADFSHD 

A. fumigatus 13073 PAQGIGFtNE LIARLTRSPV QDHTSTNsTL 
vSNPATFPLN ATMYVDFSHD 
15 A. fumigatus 32722 PAQGIGFtNE LIARLTRSPV QDHTSTNsTL 

vSNPATFPLN ATMYVDFSHD 

A. fumigatus 58128 PAQGIGFtNE LIARLTRSPV QDHTSTNsTL 
VSNPATFPLN ATMYVDFSHD 

A. fumigatus 26906 PAQGIGFtNE LIARLTRSPV QDHTSTNsTL 
20 vSNPATFPLN ATMYVDFSHD 

A. fumigatus 32239 PAQGIGFtNE LIARLTNSPV QDHTSTNsTL 
DSDPATFPLN ATIYVDFSHD 

E. nidulans PAQGIGFtNE LIARLTQSPV QDNTS TNHTL 

DSNPATFPLD rKLYADFSHD 
25 T. thermophilus PAQGVGFvNE LIARMTHSPV QDYTTVNHTL 

DSNPATFPLN ATLYADFSHD 

M. thermophila PTQGVGFvNE LLARLAgvPV RDgTSTNRTL 

DGDPrTFPLG rPLYADFSHD 

30 Consensus PAQGVGF-NE LIARLTHSPV QDHTSTNHTL 

DSNPATFPLN ATLYADFSHD 

Consensus phytase PAQGVGFANE LIARLTRSPV QDHTSTNHTL 

DSNPATFPLN ATLYADFSHD 

35 

351 

400 

A. terreus 9A-1 SNLVSIFWAL GLYNGTAPLS qTSVESVSQT 

DGYAAAWTVP FAARAYVEMM 
40 A. terreus cbs SNLVSIFWAL GLYNGTkPLS qTTVEDITrT 

DGYAAAWTVP FAARAYI EMM 

A. niger var. awamori NGIISILFAL GLYNGTkPLS TTTVENITQT 
DGFSSAWTVP FASRlYVEMM 

A. niger T213 NGIISILFAL GLYNGTkPLS TTTVENITQT 

45 DGFSSAWTVP FASRlYVEMM 

A. niger NRRL313 5 NGIISILFAL GLYNGTkPLS TTTVENITQT 

DGFSSAWTVP FASRlYVEMM 

A. fumigatus 13073 NSMVSIFFAL GLYNGTEPLS rTSVESaKEl 
DGYSASWWP FGARAYFEtM 
50 A. fumigatus 32722 NSMVSIFFAL GLYNGTGPLS rTSVESaKEl 
DGYSASWWP FGARAYFEtM 

A. fumigatus 5B128 NSMVSIFFAL GLYNGTEPLS rTSVESaKEl 
DGYSASWWP FGARAYFEtM 

A. fumigatus 26906 NSMVSIFFAL GLYNGTEPLS rTSVESaKEl 
55 DGYSASWWP FGARAYFEtM 

A. fumigatus 32239 NGMIPIFFAM GLYNGTEPLS qTSeESTKES 
NGYSASWAVP FGARAYFEtM 
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E. nidulans NSMISIFFAM GLYNGTQPLS mDSVESIQEm 

DGYAASWTVP FGARAYFELM 

T. thermophilus NTMTSIFaAL GLYNGTAkLS TTEIKSIEET 

DGYSAAWTVP FGGRAYIEMM 

AT. thermophila NDMMGVLgAL GaYDGVPPLD KTArrDpEEl 

GGYAASWAVP FAARiYVEKM 

Consensus NSMISIFFAL GLYNGTAPLS TTSVESIEET 

DGYAASWTVP FGARAYVEMM 

Consensus phytase NSMISIFFAL GLYNGTAPLS TTSVESIEET 

DGYSASWTVP FGARAYVEMM 



401 

450 

A. terreus 9A-1 QC RAEKE PLVRVLVNDR 

VMPLHGCPTD KLGRCKrDAF 

A. terreus cbs QC RAEKQ PLVRVLVNDR 

VMPLHGCAVD NLGRCKrDDF 

A. niger var. awamori QC QAEQE PLVRVLVNDR 

WPLHGCPID aLGRCTrDS F 

A. niger T213 QC QAEQE PLVRVLVNDR 

WPLHGCPID aLGRCTrDS F 

A. niger NRRL3135 QC QAEQE PLVRVLVNDR 

WPLHGCPVD aLGRCTrDS F 

A. fumigatus 13073 QC KSEKE PLVRAL INDR 

WPLHGCDVD KLGRCKLNDF 

A. fumigatus 32722 QC KSEKE PLVRAL INDR 

WPLHGCDVD KLGRCKLNDF 

A. fumigatus 58128 QC KSEKE SLVRALINDR 

WPLHGCDVD KLGRCKLNDF 

A. fumigatus 26906 QC KSEKE PLVRAL INDR 

WPLHGCDVD KLGRCKLNDF 

A. fumigatus 32239 QC KSEKE PLVRAL INDR 

WPLHGCAVD KLGRCKLKDF 

E. nidulans QC E.KKE PLVRVLVNDR 

WPLHGCAVD KFGRCTLDDW 

T. thermophilus QC DDSDE PWRVLVNDR 

WPLHGCEVD SLGRCKrDDF 

M. thermophila RCsggggggg ggegrQEKDE eMVRVLVNDR 

VMTLkGCGAD ErGMCTLErF 

Consensus QC QAEKE PLVRVLVNDR 

WPLHGCAVD KLGRCKLDDF 

Consensus phytase QC QAEKE PLVRVLVNDR 

WPLHGCAVD KLGRCKRDDF 



471 

A. terreus 9A-1 
A. terreus cbs 

GNWAECF 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

GDWAECFA — - 

A. fumigatus 13073 

A. fumigatus 32722 



451 

VAGL S FAQAG GNWADCF-- 
VEGLSFARAG 

VrGLSFARSG GDWAECsA- 
VrGLSFARSG GDWAECFA - 
VrGLSFARSG 

VKGLSWARSG GNWGECFS- 
VKGLSWARSG GNWGECFS - 
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A. fumigatus 58128 
A. fumigatus 26906 
A. fumigatus 32239 
GNSEQSFS — - 
5 B. nidulans 

T. thermophilus 
M. thermophila 



VKGLSWARSG GNWGECFS — - 

VKGLSWARSG GNWGECFS — - 
VKGLSWARSG 

VEGLNFARSG GNWkTCFTl - - 

VrGLSFARqG GNWEGCYAas e 

I E SMAFARGN GKWD1CFA — - 



Consensus 
10 Consensus phytase 



VEGLSFARSG GNWAECFA-- 
VEGLSFARSG GNWAECFA. . 



Figure 2 

CP-l 

Eco RIMGVFVVLLS IATLFGST 
15 TATATGAATrC&XfiGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCA 

X + + + + + + 60 

ATATACTTAAGTACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGT 

SGTALGPRGNSHSCDTVDGG 
20 CATCCGGTACCGCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTG 

61 + + + + + + 120 

GTAGGCCATGGCGGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCAC 
CP-2 

CP-3 

25 YQCFPEISHIiWGQYSPYFSL 
GTTACCAATGTTTCCCAGAAATTTCTCACTTGTGGGGTCAATACTCTCCATACTTCTCTT 

12 l + --- + + + + + 180 

CAATGGTTACAAAGGGTCTTTAAAGAGTGAACACCCCAGTTATGAGAGGTATGAAGAGAA 

30 EDESAISPDVPDDCRVTFVQ 
TGGAAGACGAATCTGCTATTTCTCCAGACGTTCCAGACGACTGTAGAGTTACTTTCGTTC 

181 + + + + + + 240 

ACCTTCTGCTTAGACGATAAAGAGGTCTGCAAGGTCTGCTGACATCTCAATGAAAGCAAG 

CP-4 

35 CP-5 

VLSRHGARYPTSSKSKAYSA 
AAGTTTTGTCTAGACACGGTGCTAGATACCCAACTTCTTCTAAGTCTAAGGCTTACTCTG 
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241 + + + + + + 3Q0 

TTCAAAACAGATCTGTGCCACGATCTATGGGTTGAAGAAGATTCAGATTCCGAATGAGAC 

LIEAIQKNATAFKGKYAFLK 

CTTTGATTGAAGCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGA 

301 + + + -__ + + + 26Q 

GAAACTAACTTCGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACT 

CP-6 

CP-7 

TYNYTLGADDLTPFGENQMV 

AGACTTACAACTACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAAAACCAAATGG 

361 + + + + + + 420 

TCTGAATGTTGATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTTTGGTTTACC 

NSGIKFYRRYKALARKIVPF 
TTAACTCTGGTATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCAT 



421 



-+ 480 



AATTGAGACCATAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTA 

CP-8 

CP-9 

I R ASGSDRVIASAEKF I EG F 
T CATT AG AG CTTCTGGTTCTG ACAG AG TTATTGCTTCTG CTGAAAAGTTCATTGAAGGTT 
481 + + + + + + 540 

AGTAATCTCGAAGACCAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAA 



QSAKLADPGSQPHQAS PVID 
TCCAATCTGCTAAGTTGGCTGACCCAGGTTCTCAACCACACCAAGCTTCTCCAGTTATTG 

541 + + + + + + 6Q0 

AGGTTAGACGATTCAACCGACTGGGTCCAAGAGTTGGTGTGGTTCGAAGAGGTCAATAAC 

CP-10 

CP-11 



VI IPEGSGYNNTLDHGTCTA 
ACGTTATTATTCCAGAAGGaTCcGGTTACAACAACACTTTGGACCACGGTACTTGTACTG 
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TGCAATAATAAGGTCTTCCtAGgCCAATGTTGTTGTGAAACCTGGTGCCATGAACATGAC 

FEDSBLGDDVEANFTALFAP 
CTTTCGAAGACTCTGAATXGGGTGACGACGTTGAAGCTAACTTCACTGCTTTGTTCGCTC 

661 + --- - + + + + + 720 

GAAAGCTTCTGAGACTTAACCCACTGCTGCAACTTCGATTGAAGTGACGAAACAAGCGAG 

CP-12 

AIRARLEADLPGVTLTDEDV 
CAGCTATTAGAGCTAGATTGGAAGCTGACTTGCCAGGTGTTACTTTGACTGACGAAGACG 

721 + + + + + + 780 

GTCGATAATCTCGATCTAACCTTCGACTGAACGGTCCACAATGAAACTGACTGCTTCTGC 

CP-13 

VYLMDMCPFETVARTSDATE 
TTGTTTACTTGATGGACATGTGTCCATTCGAAACTGTTGCTAGAACTTCTGACGCTACTG 

781 + + + + + + 840 

AACAAATGAACTACCTGTACACAGGTAAGCTTTGACAACGATCTTGAAGACTGCGATGAC 

LSPFCALFTHDEWRQYDYLQ 
AATTGTCTCCATTCTGTGCTTTGTTCACTCACGACGAATGGAGACAATACGACTACTTGC 

841 + + + + + + 900 

TTAACAGAGGTAAGACACGAAACAAGTGAGTGCTGCTTACCTCTGTTATGCTGATGAACG 
CP-14 

CP-15 

SLGKYYGYGAGNPLGPAQGV 
AATCTTTGGGTAAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTG 

901 + + + + + + 960 

TTAGAAACCCATTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCAC 

GFANEL IARLTRSPVQDHTS 
TTGGTTTCGCTAACGAATTGATTGCTAGATTGACTAGATCTCCAGTTCAAGACCACACTT 



Modtaget PD 
2 2 JAN, 1999 



-58 

ssi + + 



1020 

AACCAAAGCGATTGCTTAACTAACGATCTAACTGATCTAGAGGTCAAGTTCTGGTGTGAA 

CP-16 

CP-17 

TNHTLDSNPATFPLNATLYA 
CTACTAACCACACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACG 

1021 + + + + + + 

1080 

GATGATTGGTGTGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGC 

DFSHDNSMISIFFALGIiYNG 
CTGACTTCTCTCACGACAACTCTATGATTTCTATTTTCTTCGCTTTGGGTTTGTACAACG 
1081 + + + + + + 

1140 

GACTGAAGAGAGTGCTGTTGAGATACTAAAGATAAAAGAAGCGAAACCCAAACATGTTGC 

CP-18 

CP-19 

TAPLSTTSVES IEETDGYSA 
GTACTGCTCCATTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTG 
1141 + + + + + + 

1200 

CATGACGAGGTAACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGAC 

SWTVPFGARAYVEMMQCQAE 

CTTCTTGGACTGTTCCATTCGGTGCTAGAGCTTACGTTGAAATGATGCAATGTCAAGCTG 

1201 + + + + + + 

1260 

GAAGAACCTGACAAGGTAAGCCACGATCTCGAATGCAACTTTACTACGTTACAGTTCGAC 

CP-20 

CP-21 

KEPLVRVLVNDRVVPtiHGCA 
AAAAGGAACCATTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTG 
1261 + + + + + + 

1320 
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TTTTCCTTGGTAACCAATCTCAAAACCAATTGCTGTCTCAACAAOGTAACGTGCCAACAC 

VDKLGRCKRDDFVEGLSFAR 
CTGTTGACAAGTTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTA 

5 1321 + + + + + + 

1380 

GACAACTGTTCAACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGAT 

CP-22 

SGGNWAECFA* ECO RI 
10 GATCTGGTGGTAACTGGGCTGAATGTTTCGCT TAAGAATTCATATA 

1381 + — - + + + 1426 

CTAGACCACCATTGACCCGACTTACAAAGCGAATTCTTAAGTATAT 



15 

Figure 3 

i 

50 

P. involutus {phyAD SvP. KnTAPt FPIPeseQrn WSPYSPYFPL AeYkAPPAGC 
20 QlnQVNIIQR 

P. involutus (phyA2) SvP. RniAPK FSIPeseQrn WSPYSPYFPL AeYkAPPAGC 
EInQVNIIQR 

T. pubescens hiPlRdTSAc LdVTrDvQqs WSmYSPYFPa' AtYvAPPASC 

QlnQVHIIQR 

25 A . pediades GgwQaTfvQ pfFPpQiQds WAAYTPYYPV qaYtPPPkDC 

KItQVNTIQR 

P. lycii StQf sfvAAQ LPIPaQntsn WGPYdPFFPV EpYaAPPEGC 

tVtQVNLIQR 



30 Basidio 
QVNIIQR 



S-P-R-TAAQ LPIP-Q-Q- - WSPYSPYFPV A-Y-APPAGC QI- 
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51 



100 



P. involutus (phyAl) HGARFPTSGA TTR IKAGLTK LQGvqnfTDA KFNPlkSfkY 
dLGn sDLVPF 

5 P. involutus [phyA2) HGARFPTSGA ATRIKAGLSK LQSvqnfTDP KFDFIkSfTY 
dLGtsDLVPF 



T. pubescens 
sLGqDsLVeL 

A. pediades 
10 t LGhDDLVP F 

P. lycii 
kFGvADLLPF 



HGARFPTSGA AkRIQTAVAK LKAAs nyTDP lLAFVtNyTY 
HGARFPTSGA GTRIQAAVkK LQSAktyTDP RLDFLtNyTY 
HGARWPTSGA rSRqvAAVAK IQmArpf TDP KYEFLnDfvY 



Basldio 
15 DDLVPF 



HGARFPTSGA ATR I Q AAV AK LQSA TDP KLDFL-N-TY -LG- 



101 



150 



20 P. involutus (phyAl) GAaQSfDAGQ EAFARYSkLV SkNNLPFIRA dGSDRWDSA 
TNWTAGFAsA 

P. involutus {phyA2) GAaQSfDAGl EvFARYSkLV SsDNLPFIRS dGSDRWDTA 
TNWTAGFAsA 



T. pubescens 
25 nNWTAGFA 1 A 

A. pediades 
TNWTEG F S a A 

P. lycii 
TNWTAGFGdA 



30 



Basldio 
TNWTAGFA-A 



GAtQSSEAGQ EAFTRYSsLV SaDELPFVRA SGSDRWATA 
GA1QSSQAGE ETFqRYSfLV SkENLPFVRA SSSNRWDSA 
GAnQShQTGt DmYTRYStLf egGDVPFVRA AGdQRWDS S 

GA-QSSQAGQ BAFTRYS-LV S-DNLPFVRA SGSDRWDSA 



35 



200 



151 



P. involutus {phyAl) ShNTVqPkLn LILPQtGNDT LEDNMCPaAG DSDPQvNaWL 
AVaf PSITAR 

P. involutus {phyA2) SrNAiqPkLd LILPQtGNDT LEDNMCPaAG ESDPQvDaWL 
40 Asaf PSVTAQ 



T. pubescens 
AqFAPPMTAR 

A. pediades 
SIYGTPIAnR 

5 P. lycii 
GVFAPnITAR 



-61- 

SsNSitPvLs VIISEaGNDT LDDNMCPaAG DSDPQvNqWL 



ShHvlnPiLf VILSEslNDT LDDaMCPnAG sSDPQtGiWt 



SgETvlPtLq WLqEeGNcT LcNNMCPnEv DGDest.tWL 



Basidio 
AVFAPPITAR 



10 



S-NT--P-L- VILSE-GNDT LDDNMCP-AG DSDPQ-N-WL 



201 



250 



P. involutus (phyAl) LNAAAPSvNL TDtDAfNLvs LCAF 1 TVS kE kkSdFCtLFE 
15 giPGsFeAFa 

P. involutus [phyA2) LNAAAPGANL TDaDAf NLvs LCPFmTVSkE qkSdFCtLFE 
giPGsFeAFa 



T. pubescens 
elQAE.dAFa 

20 A . pediades 
. tPEEFaqFe 

P . lyci i 
. t AEEYvSYe 



LNAGAPGANL TDtDTyNLlt LCPFETVAtE rrSeFCDIYE 
LNqqAPGANI TAaDvsNLip LCAFETIvkE tpSpFCNLF. 
LNAAAPSANL SDsDAltLmd MCPFDTLSsG naSpFCDLF . 



25 Basidio 
AF- 



LNAAAPGANL TD-DA-NL- - LCPFETVS-E --S-FCDLFK - -PEEF- 



251 



30 3 00 



P. involutus {phyAl) YgGDLDKFYG TGYGQeLGPV QGVGYVNELI ARLTnsAVRD 
NTQTNRTLDA 

P. involutus (phyA2) YaGDLDKFYG TGYGQALGPV QGVGYINELL ARLTnsAVnD 
NTQTNRTLDA 



35 T. pubescens 
HTQTNs TIiD S 

A. pediades 
NTQTNRTLDS 



YnADLDKFYG TGYGQPLGPV QGVGYINELI ARLTaQnVsD 
YfGDLDKFYG TGYGQPLGPV QGVGYINELL ARLTemPVRD 



P. lycii 
40 ETQTNRTLDS 



YyyDLDKYYG TGpGNALGPV QGVGYVNELL ARLTgQAVRD 



Modtaget PD 
1 2 JAN. 1999 



62 



Basidio 
KTQTNRTLDS 



Y-GDLDKFYG TGYGQPLGPV QGVGYINELL ARLT-QAVRD 



301 



350 



P. in vol ut us (phyAl) SPvTFPLNKT FYADFSHDN1 MVAVFSAMGL FrQPAPLsTS 
vPNPwRTWrT 

10 P. involutus (phyA2) APdTFPLNKT MYADFSHDN1 MVAVFSAMGL FrQSAPLsTS 
tPDPNRTWLT 



3\ pubescens 
tPDPaRTFLv 

A . pediades 
15 fPNPKRTWVT 

P. lycii 
kPDeNRlWVd 



SPeTFPLNRT LYADFSHDNQ MVAIFSAMGL FNQSAPLDPT 



SPlTFPLDRS IYADLSHDNQ MIAIFSAMGL FNQSSPLDPS 



dPaTFPLNRT FYADFSHDNt MVP I FAALGL FNaTA.LDPl 



Basidio 
20 PDPNRTWVT 



SP-TFPLNRT FYADF SHDNQ MVAIFSAMGL FNQSAPLDPS - 



351 

400 

25 P. involutus {phyAl) SsLVPFSGRM WERLsC. . f GT tkV 

RVLVQDqVQP 

P. involutus (phyA2) SsWPFSARM aVERLsC.a GT tkV 

RVLVQDqVQP 

T. pubescens kKIVPFSARM WERLdC . . g GA qsV 

30 RLLVNDAVQP 

A. pediades SRLtPFSARM VtERLlCqrd GTgsggpsri mrngnvqtfV 

RILVNDALQP 

P. lycii SKLVPFSGHM tVEKLaC sgkeaV 

RVLVNDAVQP 

35 

Basidio SKLVPFSARM WBRL-C GT V 

RVLVNDAVQP 



40 



401 



441 



Modtaget PD 
1 2 JAM. 1999 

-63- 

P. involutus {phyAD LEFCGGDrNG ICTLAkFVES QtFARsDGaG DFEKCFATSa - 
P. involutus (phyA2) LEFCGGDqDG ICALDkFVES QaYARsGGaG D FEKCLATTv - 

T. pubescens LAFCGADtsG vCTLDAFVES QaYARNDGEG DFEKCFAT 

A. pediades LKFCGGDiriDS 1CTLEAFVES QkYAREDGQG DFEKCFD 

P. lycii LEFCGG.vDG vCeLsAFVES QtYARENGQG DFAKCgfvPs e 

Baflidio LEFCGGD-DG -CTLDAFVES Q-VARBDGQG DPBKCFATP- - 



- 64- 



Modtaget PD 
2 2 JAN. 1999 



Figure 4 



50 

5 A. terreus 9al 

VPeDCHITFV 

A. terreus cbs 
VPdDCHITFV 



KhsdCNSVDh GYQCf PELSH kWGlYAPYFS LqDESPPPlD 
NhsdCtSVDr GYQCf PELSH kWGlYAPYFS LqDESPFPlD 



A. niger var. awamori NqsTCDTVDq GYQCf SEtSH LWGQYAPFFS LANES AI S PD 
10 VPaGCRVTFa 



A. niger NRRL3135 
VPaGCRVTFa 



NqsSCDTVDq GYQCf SEtSH LWGQYAPFFS LANESvISPE 



A. fumigatus 13073 GSkSCDTVDl GYQCsPAtSH LWGQYSPFFS LEDElSVSSK 
LPkDCRITLV 

15 A. fumigatus 32722 GSkSCDTVDl GYQCsPAtSH LWGQYSPFFS LEDElSVSSK 

LPkDCRITLV 

A. fumigatus 58128 GSkSCDTVDl GYQCsPAtSH LWGQYSPFFS LEDElSVSSK 
LPkDCRITLV 

A. fumigatus 26906 GSkSCDTVDl GYQCsPAtSH LWGQYSPFFS LEDElSVSSK 
20 LPkDCRITLV 

A. fumigatus 32239 GSkACDTVE 1 GYQCsPGtSH LWGQYSPFFS LEDElSVSSD 
LPkDCRVTFV 



E. nidulans 
VPhGCeVTFV 

25 T . thermophi 1 us 

VPqNCKITFV 

T. lanuginosa 
VPkGCRVeFV 

M. thermophi la 
30 iPdDCeVTFa 

Basidio 
pPaGCQIxqV 



QNHSCNTaDG GYQCf PNVSH VWGQYSPYFS IEQESAISeD 
DSHSCNTVEG GYQCrPEISH SWGQYSPFFS LADQSEISPD 

nvDIAR hWGQYSPFFS LAEvSEISPA 

ESRPCDTpDl GFQCgTAISH FWGQYSPYFS VPsElDaS . . 
xSxPxrxtAA qLPipxQxqx xWSPYSPYFP VAxyxA. . . . 



35 



GCRVTFV 



VPKGCRVTFV 



Consensus NSHSCDTVDG GYQC-PEISH LWGQYSPFFS LADESAISPD VP- 
FcplO NSHSCDTVDG GYQCFPEISH LWGQYSPFFS LADESAISPD 



voatacet 
2 2 JAN. 1999 



A. terreus 9al 
5 QSYNYSLDSB 

A. terreus cbs 
KSYNYSMGSE 

A. niger var. awamori 
KTYNYSLGAD 

10 A. niger NRRL313 5 

KTYNYSLGAD 

A . fumigatus 13073 
KTYNYTLGAD 

A. fumigatus 32722 
15 KTYNYTLGAD 

A. fumigatus 58128 
KTYNYTLGAD 

A . fumigatus 26906 
KTYNYTLGAD 

20 A. fumigatus 32239 

ETYNYTLGAD 

E. nidulans 
ESYNYTLGAD 

T. the:rmophilus 
25 KdYrYqLGAN 

T, lanuginosa 
RdYaYhLGAD 

M. thermophila 
RTYDYTLGAD 

30 Basidio 
xnxtYxLGxD 



-65- 

51 

QVLARHGARs PThSKTKaYA AtlaAIQKSA TaFpGKYAFL 
QVLARHGARs PTdSKTKaYA AtlaAIQKNA TaLpGKYAFL 
QVL SRHGARY PTeSKGKKYS ALIeEIQQNv TtFDGKYAFL 
QVLSRHGARY PTdSKGKKYS ALIeEIQQNA TtFDGKYAFL 
QVLSRHGARY PTSSKSKKYk kLVtAIQaNA TdFKGKFAFL 
QVLSRHGARY PTSSKSKKYk kLVtAIQaNA TdFKGKFAFL 
QVLSRHGARY PTSSKSKKYk kLVtAIQaNA TdFKGKFAFL 
QVLSRHGARY PTSSKSKKYk kLVtAIQaNA TdFKGKFAFL 
QVLSRHGARY PTASKSKKYk kLVtAlQKNA TeFKGKFAFL 
QVLSRHGARY PTeSKSKaYS GLIeAIQKNA TsFwGQYAFL 
QLLSRHGARY PTSSKTE1YS qLIsrIQKtA TaYKGyYAFL 
QVLSRHGARY PTAhKSEvYA ELLqrIQDtA TeFKGDFAFL 
QVLSRHGARa PTlkRAasYv DLIdrlHhGA isYgPgYEFL 
NIIqRHGARF PTSGaAtRiq AaVakLQsax xxtDPKLDFL 



KTYNYTLGAD 



35 

KTYNYTLGAD 



Consensus QVLSRHGARY PTSSKSKKYS ALI -AIQKNA T - FKGKYAFL 
FcplO QVLSRHGARY PTSSKSKKYS ALIEAIQKNA TAFKGKYAFL 



-66- 



Modtaget PD 
2 2 JAN. 1SS9 



150 

A. terreus 9al 
VhESAEKFVE 

A. terreus cbs 
VhESAEKFVE 

A. niger var. awamori 
VIASGEKFIE 

A. niger NRRL3135 
VIASGKKFIE 

A. fumigatus 13073 
VIASGEKFIE 

A. fumigatus 32722 
VIASGEKFIE 

A. fumigatus 58128 
VIASGEKFIE 

A. fumigatus 26906 
VIASGEKFIE 

A. fumigatus 32239 
VIASGEKFIE 

K. nidulans 
WASAEKFIN 

T, thennophi I us 
VIASGrlFIE 

r. lanuginosa 
VIASAEf Fnr 

M . th ermoph ila 
WhSAENFtQ 

Basidio 
WDSAtNWtA 



Consensus 

VIASAEKFIE 

PcplO 

VIASAEKFIE 



101 

ELTPFGrNQL rDlGaQFYeR 
NLTPFGrNQL qDlGaQFYRR 
DLTPFGEQEL VNSGIKFYQR 
DLTPFGEQEL VNSGIKFYQR 
DLTPFGEQQL VNSGIKFYQR 
DLTPFGEQQL VNSGIKFYQR 
DLTPFGEQQL VNSGIKFYQR 
DLTAFGEQQL VNSGIKFYQR 
DLTPFGEQQM VNSGIKFYQR 
DLTiFGENQM VDSGaKFYRR 
DLTPFGENQM IQlGIKFYnH 
NLTRFGEEQM MESGrQFYHR 
ELTRtGQQQM VNSGIKFYRR 
DLvPFGAxQs sQAGqEaFtR 

DLTPFGEQQM VNSGIKFYRR 
DLTPFGEQQM VNSGIKFYRR 



YNAL.TRhln PFVRATD As R 
YDTL.TRhln PFVRAADSsR 
YESL.TRnll PFIRSSGSsR 
YESL.TRnlV PFIRSSGSsR 
YKAL.ARsW PFIRASGSDR 
YKAL.ARsW PFIRASGSDR 
YKAL.ARsW PFIRASGSDR 
YKAL.ARsW PFIRASGSDR 
YKAL.AgsW PFIRSSGSDR 
YKnL.ARknt PFIRASGSDR 
YKSL.ARnaV PFVRCSGSDR 
YREq.ARelV PFVRAAGSAR 
YRAL.ARksI PFVRTAGqDR 
YsxLvSxdnL PFVRASGSDR 

YKAL-AR-IV PFVRASGSDR 
YKAL. ARKIV PFVRASGSDR 



Modtaget PD 
22 JAN. 1999 



-67- 



200 

A. terreus 9al 
5 TAFEs. . .St 

A. terreus cbs 
TAFEa. . .St 



151 

GFQTARqDDh hAnphQPSPr VDValPEGsA YNNTLEHSLC 
GFQNARqGDP hAnphQPSPr VDWIPEGtA YNNTLEHSIC 



A . niger var . awamori GFQSTKLkDP rAqpgQSSPk IDWISEAsS sNNTLDpGtC 
TvFEd. . .SE 



10 A. niger NRRL313 5 

TvFEd. . .SE 



GFQSTKLkDP rAqpgQSSPk IDWISEAsS sNNTLDpGtC 



A. fumigatus 13073 GFQqAKLADP gAt. nRAAPa ISVIIPESeT FNNTLDHGVC 
TkFEa . . . SQ 

A. fumigatus 32722 GFQqAKLADP gAt . nRAAPa ISVIIPESeT FNNTLDHGVC 
15 TkFEa. . .SQ 

A. fumigatus 58128 GFQqAKLADP gAt. nRAAPa ISVIIPESeT FNNTLDHGVC 
TkFEa. . .SQ 

A. fumigatus 26906 GFQqAKLADP gAt. nRAAPa ISVIIPESeT FNNTLDHGVC 
TkFEa . . . SQ 

20 A. fumigatus 32239 GFQqANVADP gAt . nRAAPV ISVIIPESeT YNNTLDHSVC 

TnFEa. . .SE 



E. nidulans 
vSFEn. . .dE 

T . th ermophi lus 
25 PvFEd. . .Ss 

T. lanuginosa 
PAaEe . . . Ap 

M, thermophila 
TAFEegPySt 

30 Basidio 
PxAG 



GFRkAQLhDh g.s.gQATPV VNVIIPEidG FNNTLDHStC 
GFQSAKV1DP hSdkhDAPPt INVIIeEGpS YNNTLDtGsC 
GFQdAKdrDP rSnkdQAePV INVIISEEtG sNNTLDgltC 
GFHSA1LADR gStvrPTlPy dmWIPETaG aNNTLHNDLC 



GFaxA. 



. sxntxxPx LxVILSExg. . NDTLDDNMC 



35 



TAFE--P-SE 



TAFEB. . .SE 



Consensus GFQSAKLADP -A QASPV INVIIPEG-G YNNTLDHGLC 



FcplO GFQSAKLADP GANPHQASPV XNVIIPEGAG YNNTLDHGLC 



250 



201 
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Modtaget PD 
2 2 JAN. 1999 



A. terreus 9a 1 
MCPFETVS1T 

A . terreus cbs 
MCPFETVS1T 



VGDD a vANFT AVFAPAIaqR LEAdLPGVQL StDDWNLMA 
VGDAaADNFT AVFAPAIakR LEAdLPGVQL SADDWNLMA 



5 A . niger var. awamori LADtVEANFT AtFAPSIRqR LEndLSGVtL TDtEVtyLMD 

MCSFDTIStS 



A. niger NRRL3135 
MCSFDTIStS 



LADtVEANFT AtFvPSIRqR LEndLSGVtL TDtEVtyLMD 



A, fumigatus 13073 LGDEVAANFT ALFAPdIRAR aEkhLPGVtL TDEDWSLMD 
10 MCSFDTVArT 

A. fumigatus 32722 LGDEVAANFT ALFAPdIRAR aEkhLPGVtL TDEDWSLMD 
MCSFDTVArT 

A, fumigatus 58128 LGDEVAANFT ALFAPdIRAR aEkhLPGVtL TDEDWSLMD 
MCSFDTVArT 

15 A. fumigatus 26906 LGDEVAANFT ALFAPdIRAR aKkhLPGVtL TDEDWSLMD 

MCSFDTVArT 

A. fuxnigatus 32239 LGDEVEANFT ALFAPAIRAR IEkhLPGVQL TDDDWSLMD 
MCSFDTVArT 



E. nidulans 
20 MCS FDTMArT 

T. thermophilus 
LCPFETLArn 

T. lanuginosa 
LCPFDTVGsd 

25 M. thermophila 

LCPFETVAsS 

Basidio 
LCPFETVS . . 



r ADE I EANFT AIMGPPIRkR LEndLPGIKL TNENVTyLMD 
gGHDaQEKFA kqFAPAIIEK IKDhLPGVDL AvsDVpyLMD 
.DptqpAEFl qVFGPRVlkK ItkhMPGVNL TIEDVplFMD 
IGDDaQDtYl StFAGPItAR VNAnLPGaNL TDADtVaLMD 
dSDpqxnxWl AVFAPP 1 1 AR LNAaaPGaNL TDxD axNLxx 



30 Consensus LGDDVEANFT AVFAPPIRAR LEA - LPGVNL TDEDWNLMD 

MCPFDTVA-T 



MCPFDTVART 



FcplO LGDDVEANFT AVFAPPIRAR LEAHLPGVNL TDEDWNLMD 



35 



300 

A. terreus 9al 
dKYYGYGGGN 



251 



dD. .Aht LSPF CDLFTa . . tE WtQYNYLlSL 



Modtaget PD 
7 2 JAN. 1999 

-69- 

A. terreus cbs dD. .Aht LSPF CDLFTa . . aE WtQYNYLlSL 

dKYYGYGGGN 

A. niger var. awamori Tv. .DTK LSPF CDLFTH. .dE WiHYDYLQSL 

kKYYGHGAGN 

5 A. niger NRRL3135 Tv. .DTK LSPF CDLFTH. .dE WiNYDYLQSL 

kKYYGHGAGN 

A. fumigatus 13073 SD. .ASQ LSPF CQLFTH. .nE WkKYNYLQSL 

gKYYGYGAGN 

A. fumigatus 32722 SD . .ASQ LSPF CQLFTH.. nE WkKYNYLQSL 

10 gKYYGYGAGN 

A. fumigatus 58128 SD. .ASQ LSPF CQLFTH .. nE WkKYNYLQSL 

gKYYGYGAGN 

A. fumigatus 26906 SD. .ASQ LSPF CQLFTH. .nE WkKYNYLQSL 

gKYYGYGAGN 

15 A. fumigatus 32239 AD. .ASE LSPF CAIFTH . . nE WkKYDYLQSL 

gKYYGYGAGN 

E. nidulans AH. .GTE LSPF CAIFTE . . kE W1QYDYLQSL 

SKYYGYGAGS 

T. thermophilus ht..DT LSPF CALsTQ. . eE WqaYDYYQSL 

20 gKYYGnGGGN 

T. lanuginosa PvlfPrQ LSPF CHLFTa. .dD WmaYDYYyTL 

dKYYSHGGGS 

M. thermophila SsdpATadag ggngrpLSPF CrLFSE..sE WraYDYLQSV 

gKWYGYGPGN 

25 Basidio xexxSxF CDLFexxpeE FxaFxYxgdL 

dKFYGtGyGQ 

Consensus SD--ATQ LSPF CDLFTH E W-QYDYLQSL - 

KYYGYGAGN 

30 FcplO SD..ATQ LSPF CDLFTH. .DE WIQYDYLQSL 

GKYYGYGAGN 

301 

350 

35 A. terreus 9al PLGPvQGVGW aNELMARLTR A.PVHDHTCv NNTLDAS PAT 

FPLNATLYAD 

A. terreus cbs PLGPvQGVGW aNELIARLTR S . PVHDHTCv NNTLDANPAT 

FPLNATLYAD 
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Modtaget PD 
? 2 JAN, 1999 



A. niger var. awamori PLGPTQGVGY aNELIARLTH S.PVHDDTSS NHTLDSNPAT 
FPLNSTLYAD 



A. niger NRRL313 5 
FPLNSTLYAD 



PLGPTQGVGY aNELIARLTH S.PVHDDTSS NHTLDS S PAT 



5 A. fumigatus 13073 PLGPAQGIGF tNEL I ARLTR S . PVQDHTST NsTLvSNPAT 

FPLNATMYvD 

A. fumigatus 32722 PLGPAQGIGF tNEL I ARLTR S . PVQDHTST NsTLvSNPAT 
FPLNATMYvD 

A. fumigatus 58128 PLGPAQGIGF tNELIARLTR S . PVQDHTST NsTLvSNPAT 
10 FPLNATMYvD 

A. fumigatus 2 6906 PLGPAQGIGF tNELIARLTR S . PVQDHTST NsTLvSNPAT 
FPLNATMYvD 

A. fumigatus 322 3 9 PLGPAQGIGF tNELIARLTN S . PVQDHTST NsTLDSDPAT 
FPLNAT I YvD 



15 E. nidulans 

FPLDrkLYAD 

T. thermophilus 
FPLNATLYAD 

T. lanuginosa 
20 FPLDAvLYAD 

M. thermophila 
FPLGrPLYAD 

Basidio 
FPLNrTFYAD 



PLGPAQGIGF tNELIARLTQ S . PVQDNTST NHTLDSNPAT 
PLGPAQGVGF vNELIARMTH S . PVQDYTTv NHTLDSNPAT 
AFGPSRGVGF vNEL I ARMTg NlPVKDHTTV NHTLDdNPET 
PLGPTQGVGF vNELLARLA. GvPVRDgTST NRTLDGDPrT 
PLGPvQGVGY iNELLARLTx qa . VRDNTqT NRTLDSSPxT 



25 



FPLNATLYAD 



FPLNATLYAD 



30 



Consensus PLGPAQGVGF -NELIARLTH S- PVQDHTST NHTLDSNPAT 



FcplO PLGPAQGVGF VNSLIARLTH S. PVQDHTST NHTLDSNPAT 



351 



400 



A. terreus 9al 
AAWTVPFAAR 

35 A. terreus cbs 

AAWTVPFAAR 



FSHDSnLVSI FWALGLYNGT aPLSqTSVE . . SvsQTDGYA 
FSHDSnLVSI FWALGLYNGT kPLSqTTVE . . ditrTDGYA 



A. niger var. awamori FSHDNGIISI LFALGLYNGT kPLSTTTVE. . NitQTDGFS 
SAWTVPFASR 



Modtaget PD 



- 71 - 2 2 JAN. 1999 

A. niger NRRL313 5 
SAWTVPFASR 

A . fumigatus 13073 
ASWvVPFGAR 

5 A . fumigatus 32722 

ASWvVPFGAR 

A . fumigatus 5812 8 
ASWvVPFGAR 

A. fumigatus 26906 
10 ASWvVPFGAR 

A* fumigatus 32239 
ASWAVPFGAR 

E. nidulans 
ASWTVPFGAR 

15 r. thermophilus 

AAWTVPFGGR 

T. lanuginosa 
ASWTVPFAAR 

M . t h ermoph i 1 a 
20 ASWAVP FAAR 

Basidio 
TSklVPFSAR 



Consensus FSHDNTMVS I FFALGLYNGT -PLSTTSVEP -S-EETDGYA 

25 ASWTVPFAAR 

FcplO FSHDNTMVSI FFALGLYNGT KPLSTTSVE. . SIEETDGYA 

ASWTVPFAAR 



401 

30 450 



A. terreus 9al AYVEMMQC. . ra EKEPL VRVLVNDRVM 

PLHGCPtDKL 

A. terreus cbs AYIEMMQC. . ra EKQPL VRVLVNDRVM 

PLHGCAVDNL 

35 A. niger var. awamori 1YVEMMQC. . Qa EQEPL VRVLiVNDRW 

PLHGCPIDaL 

A. niger NRRL3135 1YVEMMQC . . Qa EQEPL VRVLVNDRW 

PLHGCPVDaL 



FSHDNGIISI LFALGLYNGT kPLSTTTVE . .NitQTDGFS 
FSHDNSMVSI FFALGLYNGT ePLSrTSVE. .SaKElDGYS 
FSHDNSMVSI FFALGLYNGT gPLSrTSVE. . SaKElDGYS 
FSHDNSMVSI FFALGLYNGT ePLSrTSVE. .SaKElDGYS 
FSHDNSMVSI FFALGLYNGT ePLSrTSVE. . SaKElDGYS 
FSHDNGMIPI FFAMGLYNGT ePLSqTSeE. . StKESNGYS 
FSHDNSMISI FFAMGLYNGT qPLSmdSVE. . SiQEmDGYA 
FSHDNTMtSI FaALGLYNGT akLSTTelK. . SiEETDGYS 
FSHDNTMtGI FsAMGLYNGT kPLSTSklQP pTgAAADGYA 
FSHDNdMMGV LgALGaYDGv pPLdkTA. .R rdpEElGGYA 
FSHDNqMVAI FsAMGLFNqS aPLdPSxpDP nrt Wv 



Modtaget PD 
? 2 JAN. 13S9 

- 72- 



A . fumigatus 13073 AYfEtMQC. Ks EKEPL VRaLINDRW 

PLHGCDVDKIj 

A. fumigatus 32722 AYfEtMQC. . Ks EKEPL VRaLINDRW 

PLHGCDVDKL 

5 A. fumigatus 58128 AYfEtMQC. Ks EKESL VRaLINDRW 

PLHGCDVDKL 

A. fumigatus 26906 AYfEtMQC. . Ks EKEPL VRaLINDRW 

PLHGCDVDKL 

A. fumigatus 32239 AYfEtMQC. . Ks EKEPL VRaLINDRW 

10 PLHGCAVDKL 

E. nidulans AYfELMQC. . E KKEPL VRVLVNDRW 

PLHGCAVDKF 

X. thermophilus AYIEMMQC. . Dd SDEPV VRVLVNDRW 

PLHGCEVDsL 



15 T. lanuginosa AYVELLRC. . Etetsseeee EG. . .EDEPF VRVLVNDRW 

PLHGCrVDRW 

M, thermophila i YVEkMRC . . sggggggggg EGrqeKDEeM VRVLVNDRVM 

TLkGCGaDEr 

Basidio mvVErLxCxx xgtxxxxxxx xxxxxxxxxx VRVLVNDaVq 

20 PLEfCGgDxd 



Consensus AYVEMMQC- - E EG EKEPL VRVLVNDRW 

PLHGCGVDKL 



25 PLHGCGVDKL 



FcplO AYVEMMQC. . KA 



EKEPL VRVLVNDRW 



A. terreus 9al 

A. terreus cbs 

A. niger var. awamori 

A. niger NRRL3135 

A. fumigatus 13073 

A . fumigatus 32722 

A. fumigatus 58128 

A. fumigatus 26906 

A. fumigatus 32239 

£7. nidulans 

T. thermophilus 

T. lanuginosa 

M. thermophila 

Basidio 



- 73- 

451 

GRCKrDAFVA GLSPAQAG. . 
GRCKrDDFVE GLSFARAG . . 
GRCtrDsFVr GLSFARSG . . 
GRCtrDsFVr GLSFARSG. . 
GRCK1NDFVK GLSWARSG. . 
GRCK1NDFVK GLSWARSG . . 
GRCKlNDFVK GLSWARSG . . 
GRCK1NDFVK GLSWARSG. . 
GRCK1KDFVK GLSWARSG. . 
GRCtlDDWVE GLNFARSG . . 
GRCKrDD FVr GLSFARqG . . 
GRCRrDEWIK GLTFARqG. . 
GmCtlErFIE SMAFARGN . . 
GxCtlDAFVE SqxYAReDgq 



Modtaget PD 
. 2 2 JAN, 1383 



482 

GNWADCF — 

GNWAECF — 

GDWAECsA — — 
GDWAECFA — — 
GNWGECFS — — 
GNWGECFS — — 
GNWGECFS — — 
GNWGECFS — — 
GNSEQSFS — — 
GNWKtCFTl ~ — 
GNWEGCYAas e- 

GHWDrCF — 

GKWD1CFA — — 
GDFEKCFAtp XX 



Consensus GRCK-DDFVE GLSFARSG- - GNWEECFA- - 



FcplO GRCKRDDFVE GLSFARSG.. GNWEECFA. 
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Modtaget PD 
22 JAM, 1939 



Figure 5 

CP-l 

ECO RI MGVFVVLIiS IATLFG S T 17 
TATATCAATTCAIfiGGCOTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCA 
1 + + + + + + 6Q 

ATATACTTAAGTACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGT 

SGTALGPRGNSHS CDTVDGG 37 

CATCCGGTACCGCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTG 

61 + + + + + + 

GTAGGCCATGGCGGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCAC 

CP-2 

CP-3.10 

YQCFPEISHLWGQYSP£FSL57 
GTTAGCAATGTTTCCCAGAAATTTCTCACTTGTGGGGTCAATACTCTCCATTCTTCTCTT 

121 + + + + + + 180 

CAATGGTTACAAAGGGTCTTTAAAGAGTGAACACCCCAGTTATGAGAGGTAAGAAGAGAA 

^DESAISPDVP£<3CRVTFVQ77 

TGGCTGACGAATCTGCTATTTCTCCAGACGTTCCAAAGGGTTGTAGAGTTACTTTCGTTC 
181 + + + + + + 

ACCGACTGCTTAGACGATAAAGAGGTCTGCAAGGTTTCCCGACATCTCAATGAAAGCAAG 

CP-4.10 

CP-5,1Q 

VLSRHGARYPTSSKSK£YSA97 

AAGTTTTGTCTAGACACGGTGCTAGATACCCj^CTTCTTCTAAGTCTAAGAAGTACTCTG 

241 + + + + + + 3Q0 

TTCAAAACAGATCTGTGCCACGATCTATGGGTTGAAGAAGATTCAGATTCTTCATGAGAC 

LIEAIQKNATAFKGKYAFLK 117 
CTTTGATTGAAGCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGA 



ll JAN, 1399 



-75- 

301 + + + + + + 360 

GAAACTAACTTCGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACT 

CP-6 

CP-7.10 

5 TYNYTLGADDLTPFGEQQMV 137 

AGACTTACAACTACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAACAACAAATGG 

361 + + + + + + 420 

TCTGAATGTTGATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTGTTGTTTACC 

10 NSGIKFYRRYKALARKIVPF 157 

TTAACTCTGGTATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCAT 

421 + + + + + + 480 

AATTGAGACCATAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTA 

CP-9.1Q 

15 CP-9.1Q 

yRASGSDRVIASAEKFIEGF 177 
TCGTTAGAGCTTCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTT 

481 + + + + + + 540 

AGCAATCTCGAAGACCAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAA 

20 

QSAKLADPGAHPHQASPVIN 197 
TCCAATCTGCTAAGTTGGCTGACCCAGGTGCTAACCCACACCAAGCTTCTCCAGTTATTA 

541 + + + + + + 600 

AGGTTAGACGATTCAACCGACTGGGTCCACGATTGGGTGTGGTTCGAAGAGGTCAATAAT 

25 CP- 10. 10 

CP-11.10 

VI I PEGAGYNNTLDHGfcCTA 217 
ACGTTATTATTCCAGAAGGTGCTGGTTACAACAACACTTTGGACCACGGTTTGTGTACTG 

601 + + + + + + 660 

30 TGCAATAATAAGGTCTTCCACGACCAATGTTGTTGTGAAACCTGGTGCCAAACACATGAC 



FEESELGDDVEANFTAYFAP 237 
CTTTCGAAGAATCTGAATTGGGTGACGACGTTGAAGCTAACTTCACTGCTGTTTTCGCTC 



661 



Modtaget hu 

... . on 

2 2 JAN, 1999 

-76- 

+ + + + + + 720 

GAAAGCTTCTTAGACTTAACCCACTGCTGCAACTTCGATTGAAGTGACGACAAAAGCGAG 

CP-12.10 

PIRARLEAfiLPGVELTDEDV 257 
CACCTATTAGAGCTAGATTGGAAGCTCACTTGCCAGGTGTTAACTTGACTGACGAAGACG 



721 



+ 780 



10 



GTGGATAATCTCGATCTAACCTTCGAGTGAACGGTCCACAATTGAACTGACTGCTTCTGC 

CP-13 . 1 0 

VHIiMDMCPFETVARTSDATQ 277 
TTGTTAACTTGATGGACATGTGTCCATTCGACACTGTTGCTAGAACTTCTGACGCTACTC 



781 



■+ 840 



15 



AACAATTGAACTACCTGTACACAGGTAAGCTGTGACAACGATCTTGAAGACTGCGATGAG 

LSPFCDLFTHDEWIQYDYLQ 297 
AATTGTCTCCATTCTGTGACTTGTTCACTCACGACGAATGGATTCAATACGACTACTTGC 



841 



-+ 900 



20 



TTAACAGAGGTAAGACACTGAACAAGTGAGTGCTGCTTACCTAAGTTATGCTGATGAACG 

CP-14,1Q 

CP-15,1Q 

SLGKYYGYGAGNPLGPAQGV 317 
AATCTTTGGGTAAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTG 



901 



- + 960 



25 



TTAGAAACCCATTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCAC 



961 



30 1020 



GF^NELIARLTHSPVQDHTS 337 
TTGGTTTCGTTAACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTT 

+ + + + + + 

AACCAAAGCAATTGCTTAACTAACGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAA 

CP-16.10 

CP-17.1Q 



2 2 JAN. 1939 

- 77- 

TNHTLDSNPATFPLNATLYA 357 

CTACTAACCACACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACG 
1021 + + + + + + 

1080 

GATGATTGGTGTGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGC 

DFSHDNXMYSIFFALGLYNG 377 

CTGACTTCTCTCACGACAACACTATGGTTTCTATTTTCTTCGCTTTGGGTTTGTACAACG 

1081 + + + + + + 

1140 

GACTGAAGAGAGTGCTGTTGTGATACCAAAGATAAAAGAAGCGAAACCCAAACATGTTGC 

CP-18. 10 

CP-19.10 

TEPLSTTSVESIEETDGY&A 397 

GTACTAAGCCATTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACGCTG 

1141 + + + + + + 

1200 

CATGATTCGGTAACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGCGAC 

SWTVPFAARAYVEMMQCfiAE 417 

CTTCTTGGACTGTTCCATTCGCTGCTAGAGCTTACGTTGAAATGATGCAATGTGAAGCTG 

1201 + + + + + + 

1260 

GAAGAACCTGACAAGGTAAGCCACGATCTCGAATGCAACTTTACTACGTTACACTTCGAC 

CP-20.10 

CP-21.10 

KEPLVRVLVNDRVVPLHGCfi 437 

AAAAGGAACCATTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTG 
1261 + + + + + + 

1320 

TTTTCCTTGGTAACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACAC 

VDKLGRCKRDDFVEGLS FAR 457 
GTGTTGACAAGTTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTA 



1321 

1380 



Modtaget PD 
22 JAM. 1999 

- 78- 

+ + + + + + 

CACAACTGTTCAACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCOAT 

CP-22.1Q 

SGGNWEECFA* Eco RI 467 
GATCTGGTGGTAACTGGGAAGAATGTTTCGCTTAAGAATTCATATA 

1381 + + - — + + "26 

CTAGACCACCATTGACCCTTCTTACAAAGCGAATTCTTAAGTATAT 



10 



Figure 6 



50 



15 P. involutus {phyAl) 
pPaGCQInqV 

P. involutus {phyA2) 
pPaGCelnqV 

T. pubescens 
20 pPaSCQInqV 

A . pediades 
pPKDCKITqV 

P. lycil 
pPEGCtVTqV 

25 A . terreus 9al 
VPEDCHITFV 

A. terreus cbs 
VPDDCHITFV 

A. niger var. awamori 
30 VPaGCRVTFa 

A . nigrer T213 
VPaGCRVTFa 

A . niger NRRL313 5 
VPaGCRVTFa 

35 A. fumigratus ATCC13073 
LPKDCRITLV 



-FPipeseqR nWSPYSPYFP LAEykA. 



-FsipeseqR nWSPYSPYFP IiAEykA. 



-LDvtRDVqQ sWSmYSPYFP aAtyvA. 



-pffpPQIqD sWAaYTPYYP VqAyTP . 



-LPipAQnTs nWGPYdPFFP VEpyAA. 



KhsdCNSVDh GYQCf PELSH kWGlYAPYFS LqDESPFPID 



NhsdCtSVDr GYQCf PELSH kWGlYAPYFS LqDESPFPID 



NqsTCDTVDq GYQCf SEtSH LWGQYAPFFS LANES AISPD 



NqsSCDTVDq GYQCf SEtSH LWGQYAPFFS LANESvISPD 



NqsSCDTVDq GYQCf SEtSH LWGQYAPFFS LANESvISPE 



GSkSCDTVDl GYQCsPAtSH LWGQYSPFFS LEDE1SVSSK 



Modtaget PD 
2 2 JAM, 1999 
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A. fumigatus ATCC32722 
LPKDCRITLV 



GSkSCDTVDl GYQCsPAtSH LWGQYSPFFS LEDElSVSSK 



A. fumigatus ATCC5 812 8 
LPKDCRITLV 



GSkSCDTVDl GYQCsPAtSH LWGQYSPFFS LEDElSVSSK 



5 A . fumigatus ATCC26906 
LPKDCRITLV 



GSkSCDTVDl. GYQCsPAtSH LWGQYSPFFS LEDElSVSSK 



A. fumigatus ATCC3223 9 
LPKDCRVTFV 



GSkACDTVEl GYQCsPGtSH LWGQYSPFFS LEDE1SVSSD 



E. nidulans 
10 VPhGCeVTFV 



QNHSCNTaDg GYQCf PNVSH VWGQYSPYFS IEQESAISeD 



T . th ermophi lus 
VPQNCKITFV 



DSHSCNTVEg GYQCrPEISH sWGQYSPFFS LADQSEISPD 



T. lanuginosa 
VPKGCRVeFV 



-nvDIAR hWGQYSPFFS LAEvSEISPA 



15 M. thermophi la 
I PDDCe VTFa 



ESRPCDTpDl GFQCgTAISH FWGQYSPYFS VPsElDaS . . 



Consensus Seg. 11 
VPKGCRVTFV 



NSHSCDTVD- GYQC-PHISH LWGQYSPFFS LADE SAT SPD 



20 



100 
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P . in vol u tus ( phyAl ) 
KSFKYdLGns 



NIIqRHGARF PTSGaTtRik AgLtKLQgvq nftDAKFnFI 



25 P. involutus {phyA2) 
KSFtYdLGTs 



NIIqRHGARF PTSGaAtRik AgLsKLQsvq nftDPKFDFI 



T, puhescens 
t nY t YSLGqD 

A. pediades 
30 tnYtYTLGhD 



HI IqRHGARF PTSGaAKRiq TaVAKLKaaS ny t DP 1 LAFV 



NIIqRHGARF PTSGaGtRiq AaVKKLQsak TytDPRLDFL 



P. lycii 
NdFvYkFGvA 



NLIqRHGARW PTSGarsRqv AaVAKIQmar PftDPKYEFL 



A . terreus 9al 
QSYNYSLDSE 



QVLARHGARs PThSKTKaYA AtlAalQKSA TaFpGKYAFL 



35 A. terreus cbs 
KSYNYSMGSE 



QVLARHGARs PTdSKTKaYA AtlAalQKNA TaLpGKYAFL 



A . niger var. awamori QVLSRHGARY PTeSKGKKYS ALIEelQQNv TtFDGKYAFL 
KTYNYSLGAD 



Modtaget PD 
2 2 JAN. 19S9 



A . niger T213 
KTYNYSLGAD 



-80- 

QVLSRHGARY PTe S KGKKYS ALIEelQQNv TtFDGKYAFL 



A . niger NRRL313 5 
KTYNYSLGAD 



QVLSRHGARY PTdS KGKKYS ALIEelQQNA TtFDGKYAFL 



5 A. fumigatus ATCC13073 
KTYNYTLGAD 



QVLSRHGARY PTSSKSKKYk kLVtalQaNA TdFKGKFAFL 



A . fumigatus ATCC32722 
KTYNYTLGAD 



QVLSRHGARY PTSSKSKKYk kLVtalQaNA TdFKGKFAFL 



A. fumigatus ATCC58128 
10 KTYNYTLGAD 



QVLSRHGARY PTSSKSKKYk kLVtalQaNA TdFKGKFAFL 



A. fumigatus ATCC26906 
KTYNYTLGAD 



QVLSRHGARY PTSSKSKKYk kLVtalQaNA TdFKGKFAFL 



A. fumigatus ATCC32239 
ETYNYTLGAD 



QVLSRHGARY PTASKSKKYk kLVtalQKNA Te FKGKFAFL 



15 E. nidulans 
ESYNYTLGAD 



QVLSRHGARY PTeSKSKaYS GLIEalQKNA TsFwGQYAFL 



T. thermophilus 
Kd Y r YqLGAN 

T. lanuginosa 
20 RdYaYhLGAD 



QLLSRHGARY PTSSKTE1YS qLTsRIQKtA TaYKGyYAFL 



QVLSRHGARY PTAhKSEvYA ELLQRIQDtA TeFKGDFAFL 



M. thermophila 
RTYDYTLGAD 



QVLSRHGARa PTlkRAasYv DLIDRIHhGA isYgPgYEFL 



Consensus Seq. 11 
25 KTYNYTLGAD 



QVLSRHGARY PTSSKSKKYS ALIBRIQKNA T-FKGKYAFL 



150 



101 



P. involutus {phyAl) 
30 WDSAtNWtA 



DLvPFGAaQs f DAGqEaFaR YskLvSKNnL PFIRAdGSDR 



P. involutus iphyA2) 
WDTAtNWtA 



DLvPFGAaQs f DAGLEvFaR YskLvSsDnL PFIRSdGSDR 



T. pubescens 
WATANNWtA 



sLveLGAtQB sEAGqEaFtR YsSLvSaDeL PFVRASGSDR 



35 A. pediades 
WDSAtNWtE 



DLvPFGAlQs sQAGeEtFQR Ysf LvSKEnL PFVRASSSNR 



P. lycii 
WDSStNWtA 



DLlPFGANQs hQTGtDMYtR YsTLf EgGdV PFVRAAGdQR 



Modtaget PD 
2 2 JAN, 1999 
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A. terreus 9al 
VhESAEKFVE 



ELTPFGrNQL rDlGaQFYeR YNAL.TRHIn PFVRATDAsR 



A. terreus cbs 
VhESAEKFVE 



NLTPFGrNQL qDlGaQFYRR YDTL . TRHIn PFVRAADSsR 



5 A. niger var. awamori 
VIASGEKFIE 



DLTPFGEQEL VNSGIKFYQR YESL.TRNII PFIRSSGSsR 



A . niger T213 
VIASGEKF IE 



DLTPFGEQEL VNSGIKFYQR YESL.TRNII PFIRSSGSsR 



A. niger NRRL3135 
10 VIASGKKFIE 



DLTPFGEQEL VNSGIKFYQR YESL.TRNIV PFIRSSGSsR 



A. fumigatus ATCC13073 
VIASGEKFIE 



DLTPFGEQQL VNSGIKFYQR YKAL.ARSW PFIRASGSDR 



A. fumigatus ATCC32 722 
VIASGEKFIE 



DLTPFGEQQL VNSGIKFYQR YKAL.ARSW PFIRASGSDR 



15 A. fumigatus ATCC58128 
VIASGEKFIE 



DLTPFGEQQL VNSGIKFYQR YKAL.ARSW PFIRASGSDR 



A. fumigatus ATCC26906 
VIASGEKFIE 



DLTAFGEQQL VNSGIKFYQR YKAL.ARSW PFIRASGSDR 



A. fumigatus ATCC3223 9 
20 VIASGEKFIE 



DLTPFGEQQM VNSGIKFYQK YKAL.AgSW PFIRSSGSDR 



E, nidulans 
WASAEKFIN 



DLTi FGENQM VDSGaKFYRR YKnL.ARKnt PFIRASGSDR 



T. thermophilus 
VIASGrlFIE 



DLTPFGENQM IQlGIKFYnH YKSL.ARNaV PFVRCSGSDR 



25 T. lanuginosa 
VIASAEfFnr 



NLTRFGEEQM MESGrQFYHR YREq . ARE IV PFVRAAGSAR 



M. thermophila 
WhSAENFtQ 



ELTRtGQQQM VNSGIKFYRR YRAL.ARKsI PFVRTAGqDR 



30 Consensus Seq. 11 
VIASAEKFIE 



DLTPFGENQM VNSGIKFYRR YKAL-ARNTV PFVRASGSDR 



200 



151 



35 P. involutus (phyAl) GFaSA. 
PAaGD 



.shNtvqPk LNLILPQ..T gNDTLEDNMC 



P. involutus {phyA2) GFaSA srNaiqPk LDLILPQ. . T gNDTLEDNMC 

PAaGE 



Modtaget PD 
2 2 JAN. 1999 
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T. pubescens 
PAaGD 

A. pediades 
PnaGs 

P. lycii 
PnevD 

A . terreus 9al 
TAFEs . . . ST 

A. terreus cbs 
TAFEA ... ST 

A. niger var. awamori 
TvFED. . .Se 

A. niger T213 
TvFED. . .Se 

A . niger NRRL3135 
TvFED. . .Se 

A. .fumigatus ATCC13073 
TkFEA. . .Sq 

A. fumigatus ATCC32722 
TkFEA. . .Sq 

A. fumigatus ATCC58128 
TkFEA. . .Sq 

A. fumigatus ATCC26906 
TkFEA. . .Sq 

A. fumigatus ATCC3223 9 
TnFEA . . . Se 

E. nidulans 
vSFEN. . . de 

T. thermophi lus 
PvFED . . .SS 

T. lanuginosa 
PAaEE. . ,AP 

M. thermophi la 
TAFEEgpyST 



GFalA ssNsiTPV LSVIISE. .A gNDTLDDNMC 



GFsAA . 



. shHvlNPI Lf VILSE . . S LNDTLDDAMC 



GFgdA sgEtvlPt LQWLQE..E gNcTLcNNMC 



GFQTARqDDh hAnpHQPSPr VDValPEGSA YNNTLEHSLC 



GFQNARqGDP hAnpHQPSPr VDWIPEGTA YNNTLEHSIC 



GFQSTKLkDP rAqpgQSSPk IDWISEASS sNNTLDpGtC 



GFQSTKLkDP rAqpgQSSPk IDWISEASS SNNTLDpGtC 



GFQSTKLkDP rAqpgQSSPk IDWISEASS SNNTLDpGtC 



GFQqAKLADP gAt.NRAAPa ISVIIPESeT FNNTLDHGVC 



GFQqAKLADP gAt.NRAAPa ISVIIPESeT FNNTLDHGVC 



GFQqAKLADP gAt.NRAAPa ISVIIPESeT FNNTLDHGVC 



GFQqAKLADP gAt.NRAAPa ISVIIPESeT FNNTLDHGVC 



GFQqANVADP gAt.NRAAPV ISVIIPESeT YNNTLDHSVC 



GFRkAQLhDh g.s.gQATPV VNVIIPEidG FNNTLDHStC 



GFQSAKV1DP hSdkHDAPPt INVIIeEGPS YNNTLDtGsC 



GFQdAKdrDP rSnkDQAePV INVIISEETG sNNTLDgltC 



GFHSA1LADR gStvRPTlPy dmWIPETAG aNNTLHNDLC 



Consensus Seq. 11 
TAPED ST 



GFQSAKLADP -A--HQASPV INVIIPEGSG YNNTLDHGLC 



250 

P. involutus (phyAl) 
LCAFlTVSK. 

5 P. involutus {phyA2) 
LCPFmTVSK. 

T. pubescens 
LCPFETVAt . 

A. pediades 
10 LCAFETIvK. 

p . lyci i 

MCPFDTLSs . 

A, terreus 9al 
MCPFETVS1T 

15 A . terreus cbs 
MCPFETVS1T 

A. niger var. awajnori 
MCSFDTIStS 

A. niger T213 
20 MCSFDTIStS 

A. niger NRRL3135 
MCSFDTIStS 

A. fumigratus ATCC13073 
MCSFDTVART 

25 A. fumlgratus ATCC32722 
MCSFDTVART 

A. fumlgatus ATCC58128 
MCSFDTVART 

A. fmnigatus ATCC26906 
30 MCSFDTVART 

A. fumigatus ATCC32239 
MCSFDTVART 

E . nidulans 
MCSFDTMART 

35 T. thermophilus 
LCPFETLARn 

T. lanuginosa 
LCPFDTVGsd 



22 JAN. 1999 

-83- 

201 

.SDpqvnaWl AVafPSItAR LNAaaPSVNL TDtDaf NLVs 
.SDpqvDaWl AsafPSVtAQ LNAaaPGaNL TDADa f NLVs 
.SDpqvnQWl AqFAPPMtAR LNAga PGaNL TDtDtyNLLt 
.SDpqtGiWT SIYGTPIanR LNqqaPGaNI TAADVsNLIp 
.GDESt.tWl GVFAPnltAR LNAaaPSaNL SDsDaLtLMD 
VGDDAvANFT AVFAPAIaqR LEAdLPGVQL StDDWNLMA 
VGDAAADNFT AVFAPAIakR LEAdLPGVQL SADDWNLMA 
LADtvEANFT AtFAPSIRqR LEndLSGVtL TDtEVtyLMD 
LADtvEANFT AtFAPSIRqR LEndLSGVtL TDtEVtyLMD 
LADtvEANFT AtFvPSIRqR LEndLSGVtL TDtEVtyLMD 
LGDEvAANFT ALFAPdIRAR aEkhLPGVtL TDEDWS LMD 
LGDEvAANFT ALFAPdIRAR aEkhLPGVtL TDEDWS LMD 
LGDEvAANFT ALFAPdIRAR aEkhLPGVtL TDEDWS LMD 
LGDEvAANFT ALFAPdIRAR aKkhLPGVtL TDEDWSLMD 
LGDEvEANFT ALFAPAI RAR IEkhLPGVQL TDDDWSLMD 
r ADE i EANFT AIMGPPIRkR LEndLPGIKL TNENVIyLMD 
gGHDAQEKFA kqFAPAIlEK IKDhLPGVDL AvsDVpyLMD 
. DptqpAEFl qVFGPRVlkK ItkhMPGVNL TlEDVplFMD 



2 2 JAN. 19S9 



84 



M . th ermoph i 1 a 
LCPFETVAsS 



IGDDAQDtYl StFAGPItAR VNAnLPGaNL TDADtVaLMD 



Consensus Seg. 11 
5 MCPFDTVART 



LGDDAEANFT AVFAPPIRAR LEA- LPGVNL TDEDWKLMD 



251 

300 

P. involutus iphyAl) ekkSdF CtLFegiPGs FeaFAYggdL 

10 dKFYG tGyGQ 

P. involutus (phyA2) eqkSdF CtLFegiPGs FeaFAYagdL 

dKFYGtGyGQ 

T. puhescens errSeF CDIYeelqAE .daFAYnadL 

dKFYGtGyGQ 

15 A. pediades etpSPF CNLF . . TPEE FaQFEYFgdL 

dKFYGtGyGQ 

P. lycii gnaSPF CDLF. .TAEE YvsYEYYydL 

dKYYGtGPGN 

A. terreus 9al dD. .Aht LSPF CDLF. .TAtE WtQYNYLlSL 

20 dKYYGYGGGN 

A . terreus cbs dD. .Aht LSPF CDLF. .TAAE WtQYNYLlSL 

dKYYGYGGGN 

A. niger var. awamori Tv. .DTK LSPF CDLF. .ThDE WiHYDYLQSL 

kKYYGHGAGN 

25 A. niger T213 Tv. .DTK LSPF CDLF. .ThDE WiHYDYLRSL 

kKYYGHGAGN 

A, niger NRRL3135 Tv. .DTK LSPF CDLF.. ThDE WiNYDYLQSL 

kKYYGHGAGN 

A. fumigatus ATCC13073 SD. -ASQ LSPF CQLF . . ThNE WkKYNYLQSL 

30 gKYYGYGAGN 

A. fumigatus ATCC32722 SD. .ASQ LSPF CQLF .. ThNE WkKYNYLQSL 

gKYYGYGAGN 

A . fumigatus ATCC5812B SD. .ASQ LSPF CQLF.. ThNE WkKYNYLQSL 

gKYYGYGAGN 

35 A. fumigatus ATCC26906 SD. .ASQ LSPF CQLF. .ThNE WkKYNYLQSL 

gKYYGYGAGN 

A . fumigatus ATCC32239 AD..ASE LSPF CAIF. .ThNE WkKYDYLQSL 

gKYYGYGAGN 



E. nidulans 
SKYYGYGAGS 

T . th ermoph ilus 
gKYYGnGGGN 

5 T . 1 anuginosa 
dKYYSHGGGS 

M, thermophila 
gKWYGYGPGN 



10 Consensus Seq. 11 
KYYGYGAGN 



350 

15 P. involutus ( phyA 1 ) 
FPLNkTFYAD 

P. involutus (phyA2) 
FPLNkTMYAD 

T. puJbescens 
20 FPLNrTLYAD 

A. pediades 
FPLDrSIYAD 

P. lycii 
FPLNrTFYAD 

25 A . terreus 9al 
FPLNATLYAD 

A. terreus cbs 
FPLNATLYAD 

A. niger var. avamori 
30 FPLNS TLYAD 

A. niger T213 
FPLNSTLYAD 

A. niger NRRL3135 
FPLNSTLYAD 

35 A. fumigatus ATCC13073 
FPLNATMYvD 

A. fumigatus ATCC32722 
FPLNATMYvD 



2 2 JAM, 19S9 

-85- 

AH. .GTE LSPF CAIF..TEKE WlQYDYLQSL 

ht . . DT LSPF CALs . . TqEE WqaYDYYQSL 

PvlfPrQ LSPF CHLF..TADD WmaYDYYyTL 

SsdpATadag ggngrpLSPF CrLF..SEsE WraYDYLQSV 

SD--ATQ LSPF CDLF--TADE W-QYDYLQSL - 

301 

eLGPvQGVGY vNEL I ARLTN S.AVRDNTqT NRTLDASPvT 
ALGPvQGVGY iNELLARLTN S.AVNDNTqT NRTLDAaPDT 
PLGPvQGVGY iNELIARLTa q.nVsDHTqT NsTLDSSPET 
PLGPvQGVGY iNELLARLTE m.PVRDNTqT NRTLDS S PIT 
ALGPvQGVGY vNELLARLTg q.AVRDETqT NRTLDSDPAT 
PLGPvQGVGW aNELMARLTR A.PVHDHTCv NNTLD AS PAT 
PLGPvQGVGW aNEL I ARLTR S.PVHDHTCv NNTLD ANP AT 
PLGPTQGVGY aNELIARLTH S . PVHDDTSS NHTLDSNPAT 
PLGPTQGVGY aNELIARLTH S. PVHDDTSS NHTLDSNPAT 
PLGPTQGVGY aNELIARLTH S . PVHDDTSS NHTLDS S PAT 
PLGPAQGIGF t NEL I ARLTR S . PVQDHTST NsTLvSNPAT 
PLGPAQGIGF tNELIARLTR S . PVQDHTST NsTLvSNPAT 



Modtaget PD 
2 2 JAM. 1SS9 



A. fumigatus ATCC5 8128 
FPLNATMYvD 



-86- 

PLGPAQGIGF tNELIARLTR S . PVQDHTST NsTLvSNPAT 



A . fumigatus ATCC2 6906 
FPLNATMYvD 



PLGPAQGIGF tNELIARLTR S. PVQDHTST NsTLvSNPAT 



5 A. fumigatus ATCC32239 
FPLNAT I YvD 



PLGPAQGIGF tNELIARLTN S . PVQDHTST NsTLDSDPAT 



E. nidulans 
FPLDrkLYAD 



PLGPAQGIGF t NEL I ARLTQ S . PVQDNTST NHTLDSNPAT 



T. thermophilus 
10 FPLNATLYAD 



PLGPAQGVGF vNEL I ARMTH S . PVQDYTTV NHTLDSNPAT 



T. lanuginosa 
FPLDAvLYAD 



AFGPSRGVGF vNELIARMTg NlPVKDHTTv NHTLDdNPET 



AT. theirmophila 
FPLGrPLYAD 



PLGPTQGVGF vNELLARLA. GvPVRDgTST NRTLDGDPrT 



15 



Consensus Soq. 11 
FPLNATLYAD 



PLGPAQGVGF -NELIARLTH S- PVQDHTST NHTLDSNPAT 



20 400 



351 



P . invol utus (phyAl ) 
TSS1VPFSGR 



FSHDN 1MVAV FsAMGLFrqP aPLSTSvpNP wrt Wr 



P . invol utus ( phyA2 ) 
TSSWPFSAR 



FSHDN1MVAV FsAMGLFrqS aPLSTSTpDP nrt Wl 



25 r. pubescens 
vkkiVPFSAR 



FSHDNqMVAI FsAMGLFNqS a PLdPTTpDP art Fl 



A. pediades 
TSRl tPFSAR 



LSHDNqMIAI FsAMGLFNqS sPLdPSfpNP krt Wv 



P. lycii 
30 DSklVPFSGH 



FSHDNTMVPI FaALGLFNAT a.LdPlkpDe nrl Wv 



A. terreus 9a 1 
AAWTVPFAAR 



FSHDSnLVSI FWALGLYNGT aPLSqTSVES Vs . . QTDGYA 



A. terreus cbs 
AAWTVPFAAR 



FSHDSnLVSI FWALGLYNGT KPLSqTTVEd It . . rTDGYA 



35 A. niger var. awamori 
SAWTVPFASR 



FSHDNGIISI LFALGLYNGT KPLSTTTVEN It . . QTDGFS 



A . niger T213 
SAWTVPFASR 



FSHDNGIISI LFALGLYNGT KPLSTTTVEN It.. QTDGFS 



Mocitagei ru 
22 JAN. 1SS9 



87 



A. niger NRRL3135 
SAWTVPFASR 

A. fumigatus ATCC13073 
ASWvVPFGAR 

5 A. fumigatus ATCC32722 
ASWvVPFGAR 

A. fumigatus ATCC5 812B 
ASWvVPFGAR 

A. fumigatus ATCC2 6906 
10 ASWvVPFGAR 

A. fumigatus ATCC32239 
ASWAVPFGAR 

E. nidulans 
ASWTVPFGAR 

15 T . th ermophi lus 
AAWTVP FGGR 

T. lanuginosa 
ASWTVPFAAR 

M. thermophila 
20 ASWAVPFAAR 



FSHDNGIISI LFALGLYNGT KPLSTTTVEN It . . QTDGFS 



FSHDNSMVSI FFALGLYNGT EPLSrTSVES ak . . ElDGYS 



FSHDNSMVSI FFALGLYNGT gPLSrTSVES ak. . ElDGYS 



FSHDNSMVSI FFALGLYNGT EPLSrTSVES ak. . ElDGYS 



FSHDNSMVSI FFALGLYNGT EPLSrTSVES ak.. ElDGYS 



FSHDNGMIPI FFAMGLYNGT EPLSqTSeES tk. . ESNGYS 



FSHDNSMISI FFAMGLYNGT QPLSmdSVES Iq . . EmDGYA 



FSHDNTMtSI FaALGLYNGT akLSTTelKS Ie. . ETDGYS 



FSHDNTMtGI FsAMGLYNGT KPLSTSklQP ptgaAADGYA 



FSHDNdMMGV LgALGaYDGv pPLdkTArrd . . peElGGYA 



Consensus Seq* 11 
ASWTVPFAAR 



FSHDNTMVSI FFALGLYNGT KPLSTTSVES I ETDGYA 



25 



450 



P. involutus (phyAl) 
PLEfCGgDRn 

P. involutus {phyA2) 
30 PLEfCGgDQd 

T. puJbescens 
PLafCGaDts 

A. pediades 
PLkfCGgDmd 

35 P. lycii 

PLEfCGg.vd 

A. terreus 9al 
PLHGCPtDKL 



401 



mvVErLsC. fGt Tk VRVLVQDQVq 



maVErLsC . . AGt Tk VRVLVQDQVq 



mvVErLDC . . GGa Qs VRLLVND a Vq 



mvtErLlCQr DGtGsGGpsr imrNgnvQTF VRILVNDaLq 



mtVEkLaC sgKea VRVLVNDaVq 



AYVEMMQCrA EK. . . EPL VRVLVNDRVM 



Modtaget 
22 JAN. 1999 

-88- 

A. terreus cbs AYIEMMQCrA EK. . .QPL VRVLVNDRVM 

PLHGCAVDNL 



A . niger var. awamori 1YVEMMQCQA EQ. . . EPL VRVLVNDRW 

PLHGCPIDaL 

5 A. niger T213 1YVEMMQCQA EQ. . .EPL VRVLVNDRW 

PLHGCPIDaL 

A. niger NRRL3135 1YVEMMQCQA EQ...EPL VRVLVNDRW 

PLHGCPVDaL 

A. fumigatus ATCC13073 AYf EtMQCKS EK. . .EPL VRaLINDRW 

10 PLHGCDVDKL 

A. fumigatus ATCC32722 AYf EtMQCKS EK. . . EPL VRaLINDRW 

PLHGCDVDKL 

A. fumigatus ATCC5812B AYf EtMQCKS EK...ESL VRaLINDRW 

PLHGCDVDKL 

15 A. fumigatus ATCC26906 AYf EtMQCKS EK. . .EPL VRaLINDRW 

PLHGCDVDKL 

A. fumigatus ATCC32239 AYf EtMQCKS EK. EPL VRaLINDRW 

PLHGCAVDKL 

E. nidulans AYf ELMQCE KK ... EPL VRVLVNDRW 

20 PLHGCAVDKF 

T. thermophilus AYIEMMQCDD sD ... EPV VRVLVNDRW 

PLHGCEVDsL 



T. lanuginosa AYVELLRCET ETsSeEEeEG . . ED . . . EPF VRVLVNDRW 
PLHGCrVDRW 

25 M. thermophila iYVEkMRCsG GGgGgGGgEG . . rQekdEeM VRVLVNDRVM 
TLkGCGaDEr 



Consensus Seq. 11 
PLHGCGVDKL 



30 



P. involutus {phyAD 
P. involutus {phyA2) 
T. puhescens 
35 A. pediades 
P. lycii 



AYVEMMQCEA GG-G-GG-EG - - EK EPL VRVLVNDRW 

451 482 

GlCtLAKFVE SqTFARSDga GDFE KCFAt S a~ 

G 1 CaLDKFVE SqAYARSGga GDFEKCLAtt v- 

GvCtLDAFVE S q AYARNDg e GDFEKCFAt- — 

SlCtLEAFVE SqkYAReDgq GDFEKCFD — — 

GvCELsAFVE SqTYAReNgq GDFAKCgfvp se 



Modtaget PD 
7.2 JAN. 19S9 



A. terreus 9al 

A. terreus cbs 

A. niger var. awamori 

A. niger T213 

A. niger NRRL3135 

A, fumigatus ATCC13073 

A. fumigatus ATCC32722 

A. fumigatus ATCC5812B 

A. fumlgratus ATCC26906 

A. fumigatus ATCC32239 

E. nidulans 

T. tiiermophilus 

T. lanuginosa 

M. thermophila 



-89- 

GRCKrDAFVA GLSFAQAG. 
GRCKrDDFVE GLSFARAG. 
GRCtrDsFVr GLSFARSG. 
GRCtrDsFVr GLSFARSG. 
GRCtrDsFVr GLSFARSG. 
GRCKLNDFVK GLSWARSG. 
GRCKLNDFVK GLSWARSG. 
GRCKLNDFVK GLSWARSG. 
GRCKLNDFVK GLSWARSG. 
GRCKLKDFVK GLSWARSG. 
GRCtLDDWVE GLNFARSG. 
GRCKrDDFVr GLSFARqG. 
GRCRrDEWIK GLTFARqG. 
GmCtLErFIE SMAFARGN. 



GNWADCF — 

GNWAECF — 

GDWAECsA — — 
GDWAECFA — — 
GDWAECFA — — 
GNWGECFS — — 
GNWGECFS — — 
GNWGECFS — — 
GNWGECFS — — 
GNSEQSFS — — 
GNWktCFTl- — 
GNWEGCYAas e~ 

GHWDrCF ~~ 

GKWD1CFA — — 



Consensus Seq. 11 



GRCKLDDFVE GLSFARSG-- GNWAECFA-- -- 



Figure 7 



-90 



Modtaget PD 
22 JAN. 1999 



MGVFVVLLSIATLFGSTSGT 

20 

ATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCACATCCGGTACC 

5 x --- + + + + + + 

60 

TACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGTGTAGGCCATGG 
ALGPRGNSHSCDTVDGGYQC 

10 40 

GCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTGGTTACCAATGT 
61 + + + + + + 

12 0 

CGGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCACCAATGGTTACA 

15 

FPEISHLWGXYSPYFSIiADE 

60 

TTCCCAGAAATTTCTCACTTGTGGGGTACCTACTCTCCATACTTCTCTTTGGCAGACGAA 
121 + + + + + + 

20 180 

AAGGGTCTTTAAAGAGTGAACACCCCATGGATGAGAGGTATGAAGAGAAACCGTCTGCTT 
SAISPDVPDDCRVTFVQVLS 

80 

25 TCTGCTATTTCTCCAGACGTTCCAGACGACTGTAGAGTTACTTTCGTTCAAGTTTTGTCT 
187 + + : + + + + 

240 

AGACGATAAAGAGGTCTGCAAGGTCTGCTGACATCTCAATGAAAGCAAGTTCAAAACAGA 

30 RHGARYPTSSASKAYSALIE 
100 

AGACACGGTGCTAGATACCCAACTTCTTCTGCGTCTAAGGCTTACTCTGCTTTGATTGAA 
241 + + + + + + 

300 

35 TCTGTGCCACGATCTATGGGTTGAAGAAGACGCAGATTCCGAATGAGACGAAACTAACTT 



AIQKNATAFKGKYAFLKTYN 

120 



Modtaget PD 
22 JAIU999 

- 91 - 

GCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGAAGACTTACAAC 
301 + + + + + + 

360 

CGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACTTCTGAATGTTG 

5 

YTLGADDLTPFGENQMVNSG 

140 

TACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAAAACCAAATGGTTAACTCTGGT 
361 + + + + + + 

10 420 

ATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTTTGGTTTACCAATTGAGACCA 
IKFYRRYKALARKIVPFIRA 

160 

1 5 ATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCATTCATTAGAGCT 
421 + + + + + + 

480 

TAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTAAGTAATCTCGA 

20 S GS DRVIASAE KF I EGFQS A 

180 

TCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTTTCCAATCTGCT 
481 + + + + + + 

54 0 

25 AGACCAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAAAGGTTAGACGA 
KLADPGSQPHQASPVIEVI I 

200 

AAGTTGGCTGACCCAGGTTCTCAACCACACCAAGCTTCTCCAGTTATTAACGTGATCATT 

30 541 --- + + + + + + 

600 

TTCAACCGACTGGGTCCAAGAGTTGGTGTGGTTCGAAGAGGTCAATAATTGCACTAGTAA 
PEGSGYNNTLDHGTCTAFED 

35 220 

CCAGAAGGATCCGGTTACAACAACACTTTGGACCACGGTACTTGTACTGCTTTCGAAGAC 



W odtafl<* YU 



-92- 

601 + + + + + + 

660 

GGTCTTCCTAGGCCAATGTTGTTGTGAAACCTGGTGCCATGAACATGACGAAAGCTTCTG 

5 SELGDDVEANFTALFAPAIR 
240 

TCTGAATTAGGTGACGACGTTGAAGCTAACTTCACTGCTTTGTTCGCTCCAGCTATTAGA 
661 + + + + + + 

720 

10 AGACTTAATCCACTGCTGCAACTTCGATTGAAGTGACGAAACAAGCGAGGTCGATAATCT 



ARLEADLPGVTLTDEDVVYL 

260 

GCTAGATTGGAAGCTGACTTGCCAGGTGTTACTTTGACTGACGAAGACGTTGTTTACTTG 

15 721 + + + + + + 

780 

CGATCTAACCTTCGACTGAACGGTCCACAATGAAACTGACTGCTTCTGCAACAAATGAAC 



MDMCPFETVARTSDATELSP 

20 280 

ATGGACATGTGTCCATTCGACACTGTCGCTAGAACTTCTGACGCTACTGAATTGTCTCCA 
781 + + + + + + 

840 

TACCTGTACACAGGTAAGCTGTGACAGCGATCTTGAAGACTGCGATGACTTAACAGAGGT 

25 

FCALFTHDEWiQYDYLQSLG 

300 

TTCTGTGCTTTGTTCACTCACGACGAATGGATCCAATACGACTACTTGCAAAGCTTGGGT 
841 --- + + + + + + 

30 900 

AAGACACGAAACAAGTGAGTGCTGCTTACCTAGGTTATGCTGATGAACGTTTCGAACCCA 
KYYGYGAGNPLGPAQGVGFA 

320 

35 AAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTGTTGGTTTCGCT 
901 + + + + + + 

960 



2 2 »St ^ 



-93- 

TTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCACAACCAAAGCGA 
NELIARLTHSPVQDHTSTNH 

340 

5 AACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTTCTACTAACCAC 
961 + + + + + + 

1020 

TTGCTTAACTAACGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAAGATGATTGGTG 

10 TLDSNPATFPLNATLYADFS 
360 

ACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACGCTGACTTCTCT 
1021 + + + + + + 

1080 

15 TGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGCGACTGAAGAGA 



HDNXMISIFFALGLYNGTJSP 

380 

CACGACAACACTATGATATCTATTTTCTTCGCTTTGGGTTTGTACAACGGTACCAAGCCA 

20 1081 + + + + + + 

1140 

GTGCTGTTGTGATACTATAGATAAAAGAAGCGAAACCCAAACATGTTGCCATGGTTCGGT 



LSTTSVESIEETDGYSASWT 

25 400 

TTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTGCTTCTTGGACT 
1141 + + + + + + 

1200 

AACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGACGAAGAACCTGA 

30 

VPF&ARAYVEMMQCQAEKEP 

420 

GTTC CATTCGCTGCT AGAGCTT ACGTTGAAATGATG C AATGTCAAGCTG AAAAGG AACC A 
12 01 + + + + + + 

35 1260 

CAAGGTAAGCGACGATCTCGAATGCAACTTTACTACGTTACAGTTCGACTTTTCCTTGGT 



94- 



LVRVLVNDRVVPLHGCAVDK 

440 

TTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTGCTGTTGACAAG 

5 1261 — +- — + +----:----+ + + 

1320 

AACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACACGACAACTGTTC 
LGRCKRDDFVEGLSFARSGG 

10 460 

TTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTAGATCTGGTGGT 
1321 + + + + + + 

1380 

AACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAT^AGCGATCTAGACCACCA 

15 

NWAECFA* 467 
AACTGGGCTGAATGTTTCGCTTAA 

1381 + + + 1410 

TTGACCCGACTTACAAAGCGAATT 

20 



25 



30 



Modtago^- 
22 JAN. 1999 

-95- 



Figure 8 



MGVFVVLLS IATLFGSTSGT 

20 

ATGGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCACATCCGGTACC 
! + + + + + + 

60 

TACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGTGTAGGCCATGG 
ALGPRGNSHSCDTVDGGYQC 

40 

GCCTTGGGTCCTCGTGGTAACTCTCACTCTTGTGACACTGTTGACGGTGGTTACCAATGT 
61 + + + + + + 

120 

CGGAACCCAGGAGCACCATTGAGAGTGAGAACACTGTGACAACTGCCACCAATGGTTACA 

A FPEISHLWG2YSPFFSLADE 
60 

TTCCCAGAAATTTCTCACTTGTGGGGTACATACTCTCCATTCTTCTCTTTGGCTGACGAA 
121 + + + + + + 

180 



AAGGGTCTTTAAAGAGTGAACACCCCATGTATGAGAGGTAAGAAGAGAAACCGACTGCTT 



Modta9 elPD 
2 2 JAN. «S9 

-96- 

SAISPDVPKGCRVTFVQVLS 

80 

TCTGCTATTTCTCCAGACGTTCCAAAGGGTTGTAGAGTTACTTTCGTTCAAGTTTTGTCT 

5 181 + + + + + + 

240 

AGACGATAAAGAGGTCTGCAAGGTTTCCCAACATCTCAATGAAAGCAAGTTCAAAACAGA 
RHGARYPTSS&SKAYSALIE 

10 100 

AGACACGGTGCTAGATACCCAACTTCTTCTGCGTCTAAGGCGTACTCTGCTTTGATTGAA 
241 + + + + + + 

300 

TCTGTGCCACGATCTATGGGTTGAAGAAGACGCAGATTCCGCATGAGACGAAACTAACTT 

15 

AIQKNATAFKGKYAFLKTYN 

120 

GCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGAAGACTTACAAC 
301 + + + + + + 

20 360 

CGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACTTCTGAATGTTG 

A YTLGADDLTPFGEQQMVNSG 
140 

25 TACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAACAACAAATGGTTAACTCTGGT 
361 + + + + + + 

420 

ATGTGAAACCCACGACTGCTGAACTGAGGTAAGCCACTTGTTGTTTACCAATTGAGACCA 

30 IKFYRRYKALARKIVPFiRA 
160 

ATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCATTCATTAGAGCT 
421 + + + + + + 

480 

35 TAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTAAGTAATCTCGA 
SGSDRVI ASAEKFIEGFQSA 

180 



22 JAN. 1S99 



-97- 

TCTGGTTCTGACAGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTTTCCAATCTGCT 
481 + + + + + + 

540 

AGACCAAGACTGTCTCAATAACGAAGACGACTTTTCAAGTAACTTCCAAAGGTTAGACGA 

5 

KLADPGANPHQASPVINVII 

200 

AAGTTGGCTGACCCAGGTGCTAACCCACACCAAGCTTCTCCAGTTATTAACGTTATTATT 
541 + + + + + + 

10 600 

TTCAACCGACTGGGTCCACGATTGGGTGTGGTTCGAAGAGGTCAATAATTGCAATAATAA 
PEGAGYNNTLDHGLCTAFEE 

220 

15 CCAGAAGGTGCTGGTTACAACAACACTTTGGACCACGGTTTGTGTACTGCTTTCGAAGAA 
601 + + + + + + 

660 

GGTCTTCCACGACCAATGTTGTTGTGAAACCTGGTGCCAAACACATGACGAAAGCTTCTT 

20 SELGDDVEANFTAVFAPPIR 
240 

TCTGAATTGGGTGACGACGTTGAAGCTAACTTCACTGCTGTTTTCGCTCCACCAATTAGA 
561 + + + + + + 

720 

25 AGACTTAACCCACTGCTGCAACTTCGATTGAAGTGACGACAAAAGCGAGGTGGTTAATCT 



ARLEAHLPGVNIiTDEDVVNIj 

260 

GCTAGATTGGAAGCTCACTTGCCAGGTGTTAACTTGACTGACGAAGACGTTGTTAACTTG 

30 721 + + + + + + 

780 

CGATCTAACCTTCGAGTGAACGGTCCACAATTGAACTGACTGCTTCTGCAACAATTGAAC 



MDMCPFDTVARTSDATQLS P 

35 280 

ATGGACATGTGTCCATTCGACACTGTTGCTAGAACTTCTGACGCTACTCAATTGTCTCCA 



22 JAN. &S3 



-98- 

781 + + + + + + 

840 

TACCTGTACACAGGTAAGCTGTGACAACGATCTTGAAGACTGCGATGAGTTAACAGAGGT 

5 FCDLFTHDEWIQYDYIiQSLG 
300 

TTCTGTGACTTGTTCACTCACGACGAATGGATTCAATACGACTACTTGCAATCTTTGGGT 
841 + + + + + + 

900 

10 AAGACACTGAACAAGTGAGTGCTGCTTACCTAAGTTATGCTGATGAACGTTAGAAACCCA 



KYYGYGAGNPLGPAQGVGFV 

320 

AAGTACTACGGTTACGGTGCTGGTAACCCATTGGGTCCAGCTCAAGGTGTTGGTTTCGTT 

15 901 + + + + + + 

960 

TTCATGATGCCAATGCCACGACCATTGGGTAACCCAGGTCGAGTTCCACAACCAAAGCAA 



NELIARLTHSPVQDHTSTNH 

20 340 

AACGAATTGATTGCTAGATTGACTCACTCTCCAGTTCAAGACCACACTTCTACTAACCAC 
961 + + + + + + 

1020 

TTGCTTAACTAACGATCTAACTGAGTGAGAGGTCAAGTTCTGGTGTGAAGATGATTGGTG 

25 

TLDSNPATFPIiNATLYADFS 

360 

ACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACGCTGACTTCTCT 
1021 + + + + + + 

30 1080 

TGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGCGACTGAAGAGA 
HDNTMVS I FFALGLYNGTKP 

380 

35 CACGACAACACTATGGTTTCTATTTTCTTCGCTTTGGGTTTGTACAACGGTACTAAGCCA 
1081 + + + + + + 

1140 
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GTGCTGTTGTGATACCAAAGATAAAAGAAGCGAAACCCA^CATGTTGCCATGATTCGGT 
LSTTSVESIEETDGYfiASWT 

400 

5 TTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTGCTTCTTGGACT 
1141 + + + + + + 

1200 

AACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGACGAAGAACCTGA 

10 vPFAARAYVEMMQCEAEKEP 
420 

GTTCCATTCGCTGCTAGAGCTTACGTTGAAATGATGCAATGTGAAGCTGAAAAGGAACCA 
1201 + + + + + + 

1260 

15 CAAGGTAAGCGACGATCTCGAATGCAACTTTACTACGTTACACTTCGACTTTTCCTTGGT 
LVRVLVNDRVVPLHGCGVDK 

440 

TTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTGGTGTTGACAAG 

20 1261 + + * + + + 

1320 

AACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACACCACAACTGTTC 
LGRCKRDDFVEGLSFARSGG 

25 460 

TTGGGTAGATGTAAGAGAGACGACTTCGTTGAAGGTTTGTCTTTCGCTAGATCTGGTGGT 
1321 + + + + + + 

1380 

AACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGATCTAGACCACCA 

30 

NWEECFA* 467 
AACTGGGAAGAATGTTTCGCTTAA 

1381 + + 1404 

TTGACCCTTCTTACAAAGCGAATT 
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5 



10 



15 



Figure 9 

20 



MGVFVVLLSIATLFGSTSGT 20 
ATGGGGGTTTTCGTCGTTCTATTATCTATCGCGACTCTGTTCGGCAGCACATCGGGCACT 

x + + + + + + 60 

TACCCCCAAAAGCAGCAAGATAATAGATAGCGCTGAGACAAGCCGTCGTGTAGCCCGTGA 



Modtaget y\j 
22 JAN. 1SS3 



101 



ALGPRGNHSKSCDTVDLGYQ 40 
GCGCTGGGCCCCCGTGGAAATCACTCCAAGTCCTGCGATACGGTAGACCTAGGGTACCAG 

61 + + + + + + 120 

CGCGACCCGGGGGCACCTTTAGTGAGGTTCAGGACGCTATGCCATCTGGATCCCATGGTC 

CSPATSHLWGIYSPXFSLED 60 
TGCTCCCCTGCGACTTCTCATCTATGGGGCACGTACTCGCCATaCTTTTCGCTCGAGGAC 

12 1 + + + + + + 180 

ACGAGGGGACGCTGAAGAGTAGATACCCCG t g CATGAGCGGTA t GAAAAGCGAGCTCCTG 

EliSVSSKIiPKDCRITLVQVL 80 
GAGCTGTCCGTGTCGAGTAAGCTTCCCAAGGATTGCCGGATCACCTTGGTACAGGTGCTA 

181 + + + + + + 240 

CTCGACAGGCACAGCTCATTCGAAGGGTTCCTAACGGCCTAGTGGAACCATGTCCACGAT 

SRHGARYPTSSKSKKYKKLX 100 
TCGCGCCATGGAGCGCGGTACCCAACCAGCTCCAAGAGCAAAAAGTATAAGAAGCTTaTt 

241 + + + + + + 300 

AGCGCGGTACCTCGCGCCATGGGTTGGTCGAGGTTCTCGTTTTTCATATTCTTCGAAtAa 

TAIQANATDFKGKXAFLKTY 120 
ACGGCGATCCAGGCCAATGCCACCGACTTCAAGGGCAAGTacGCCTTTTTGAAGACGTAC 

301 + + + + + + 360 

TGCCGCTAGGTCCGGTTACGGTGGCTGAAGTTCCCGTTCAtgCGGAAAAACTTCTGCATG 

NYTLGADDLTPFGEQQLVNS 140 
AACTATACTCTGGGTGCGGATGACCTCACTCCCTTTGGGGAGCAGCAGCTGGTGAACTCG 

361 + + -i- + + + 420 

TTGATATGAGACCCACGCCTACTGGAGTGAGGGAAACCCCTCGTCGTCGACCACTTGAGC 

GIKFYQRYKALARSVVPFIR 160 
GGCATCAAGTTCTACCAGAGGTACAAGGCTCTGGCGCGCAGTGTGGTGCCGTTTATTCGC 



Modtaget PD 
22 JAM. «S9 

- 102- 

421 + + + + + + 480 

CCGTAGTTCAAGATGGTCTCCATGTTCCGAGACCGCGCGTCACACCACGGCAAATAAGCG 

ASGSDRVIASGEKFIEGFQQ 180 
GCCTCAGGCTCGGACCGGGTTATTGCTTCGGGAGAGAAGTTCATCGAGGGGTTCCAGCAG 

481 + + + + + + 540 

CGGAGTCCGAGCCTGGCCCAATAACGAAGCCCTCTCTTCAAGTAGCTCCCCAAGGTCGTC 

AKLADPGATNRAAPAISVI I 200 
GCGAAGCTGGCTGATCCTGGCGCGACGAACCGCGCCGCTCCGGCGATTAGTGTGATTATT 

541 + + + + + + 600 

CGCTTCGACCGACTAGGACCGCGCTGCTTGGCGCGGCGAGGCCGCTAATCACACTAATAA 

PESETFNNTLDHGVCTKFEA 220 
CCGGAGAGCGAGACGTTCAACAATACGCTGGACCACGGTGTGTGCACGAAGTTTGAGGCG 

60 i + + + + + + 660 

GGCCTCTCGCTCTGCAAGTTGTTATGCGACCTGGTGCCACACACGTGCTTCAAACTCCGC 

SQLGDEVAANFTAIiFAPDIR 240 
AGTCAGCTGGGAGATGAGGTTGCGGCCAATTTCACTGCGCTCTTTGCACCCGACATCCGA 

66i + + + + + + 720 

TCAGTCGACCCTCTACTCCAACGCCGGTTAAAGTGACGCGAGAAACGTGGGCTGTAGGCT 

ARIiEKHLPGVTIiTDEDVVSL 260 
GCTCGCc t CG AGAAGC ATCTTCCTGGCGTGACGCTGACAGACGAGGACGTTGTC AGTCTA 

721 + + + + + + 780 

CGAGCGgaGCTCTTCGTAGAAGGACCGCACTGCGACTGTCTGCTCCTGCAACAGTCAGAT 

MDMCEFDTVARTSDASQLSP 280 
ATGGACATGTGTcCGTTTGATACGGTAGCGCGCACCAGCGACGCAAGTCAGCTGTCACCG 
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TACCTGTACACAgGCAAACTATGCCATCGCGCGTGGTCGCTGCGTTCAGTCGACAGTGGC 

FCQLFTHNEWKKYDYLQSIiG 31 
TTCTGTCAACTCTTCACTCACAATGAGTGGAAGAAGTACgACTACCTTCAGTCCTTGGGC 

841 + + + + + + 

AAGACAGTTGAGAAGTGAGTGTTACTCACCTTCTTCATGCTGATGGAAGTCAGGAACCCG 

KYYGYGAGNPIiGPAQGIGFT 3i 
AAGTACTACGGCTACGGCGCAGGCAACCCTCTGGGACCGGCTCAGGGGATAGGGTTCACC 

90 i + + + - — + + + 9 

TTCATGATGCCGATGCCGCGTCCGTTGGGAGACCCTGGCCGAGTCCCCTATCCCAAGTGG 

NELIARLTRSPVQDHTSTNS 

340 

AACGAGCTGATTGCCCGGTTGACgCGTTCGCCAGTGCAGGACCACACCAGCACTAACTCG 
961 + + + + + + 

1020 

TTGCTCGACTAACGGGCCAACTGcGCAAGCGGTCACGTCCTGGTGTGGTCGTGATTGAGC 
TLVSNPATFPLNATMYVDFS 

360 

ACTCTAGTCTCCAACCCGGCCACCTTCCCGTTGAACGCTACCATGTACGTCGACTTTTCA 
1021 + + + + + + 

1080 

TGAGATCAGAGGTTGGGCCGGTGGAAGGGCAACTTGCGATGGTACATGCAGCTGAAAAGT 
HDNSMVS IFFAIjGLYNGTEP 

380 

CACGACAACAGCATGGTTTCCATCTTCTTTGCATTGGGCCTGTACAACGGCACTGAACCC 
1081 + + + + + + 

1140 

GTGCTGTTGTCGTACCAAAGGTAGAAGAAACGTAACCCGGACATGTTGCCGTGACTTGGG 
LSRTSVESAKEIiDGYSASWV 
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TTGTCCCGGACCTCGGTGGAAAGCGCCAAGGAATTGGATGGGTATTCTGCATCCTGGGTG 
1141 + + + + + + 

1200 

AACAGGGCCTGGAGCCACCTTTCGCGGTTCCTTAACCTACCCATAAGACGTAGGACCCAC 

5 

VPFGARAYFETMQCKSEKEP 

420 

GTGCCTTTCGGCGCGCGAGCCTACTTCGAGACGATGCAATGCAAGTCGGAAAAGGAGCCT 
1201 + + + + + + 

10 1260 

CACGGAAAGCCGCGCGCTCGGATGAAGCTCTGCTACGTTACGTTCAGCCTTTTCCTCGGA 
LVRALINDRVVPLHGCDVDK 

440 

15 CTTGTTCGCGCTTTGATTAATGACCGGGTTGTGCCACTGCATGGCTGCGATGTGGACAAG 



1261 



- + 



1320 

GAACAAGCGCGAAACTAATTACTGGCCCAACACGGTGACGTACCGACGCTACACCTGTTC 

20 LGRCKLNDFVKGLSWARSGG 
460 

CTGGGGCGATGCAAGCTGAATGACTTTGTCAAGGGATTGAGTTGGGCCAGATCTGGGGGC 
1321 + + + + + + 

1380 

25 GACCCCGCTACGTTCGACTTACTGAAACAGTTCCCTAACTCAACCCGGTCTAGACCCCCG 

NWGECFS* 467 
AACTGGGGAGAGTGCTTTAGTTGA 

1381 + + 1404 

30 TTGACCCCTCTCACGAAATCAACT 
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Figure 10 

CP-l 

ECO RIMGVFVVLLSIATLFGST 
TATATGAArrCAIfiGGCGTGTTCGTCGTGCTACTGTCCATTGCCACCTTGTTCGGTTCCA 

+ + + + + + eo 

ATATACTTAAGTACCCGCACAAGCAGCACGATGACAGGTAACGGTGGAACAAGCCAAGGT 

SGTALGPRGNSHSCDTVDGG 
CATCCGGTACCGCCTTGGGTCCTCGTGGTAATTCTCACTCTTGTGACACTGTTGACGGTG 

61 + + + + + + 120 

GTAGGCCATGGCGGAACCCAGGAGCACCATTAAGAGTGAGAACACTGTGACAACTGCCAC 

CP-2 

CP-3 

YQCFPEISHLWGQYS PYFSIi 
GTTACCAATGTTTCCCAGAAATTTCTCACTTGTGGGGTCAATACTCTCCATACTTCTCTT 

121 + + + ♦ + + 180 

CAATGGTTACAAAGGGTCTTTAAAGAGTGAACACCCCAGTTATGAGAGGTATGAAGAGAA 

EDESAISPDVPDDCRVTFVQ 
TGGAAGACGAATCTGCTATTTCTCCAGACGTTCCAGACGACTGTAGAGTTACTTTCGTTC 

181 + + + + + + 240 

ACCTTCTGCTTAGACGATAAAGAGGTCTGCAAGGTCTGCTGACATCTCAATGAAAGCAAG 

CP-4,7 

CP-5.7 

VLSRHGARYPTDSKfiKfiYSA 
AAGTTTTGTCTAGACACGGTGCTAGATACCCAACTgacTCTAAGggtAAGaagTACTCTG 

241 + + + - — + — 300 

TTCAAAACAG ATCTGTGCCACGATCTATGGGTTGAc tgAGATTCc C aTTC 1 1 cATGAGAC 
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LIEAIQKNATAFKGKYAFLK 
CTTTGATTGAAGCTATTCAAAAGAACGCTACTGCTTTCAAGGGTAAGTACGCTTTCTTGA 

301 + + + + + + 360 

5 GAAACTAACTTCGATAAGTTTTCTTGCGATGACGAAAGTTCCCATTCATGCGAAAGAACT 

CP-6 

CP-7 

TYNYTLGADDLTPFGENQMV 
AGACTTACAACTACACTTTGGGTGCTGACGACTTGACTCCATTCGGTGAAAACCAAATGG 

10 361 + + + + + + 420 

TCTGAATGTTGATGTGAAACCCACGACfTGCTGAACTGAGGTAAGCCACTTTTGGTTTACC 

■NSGIKFYRRYKALARKIVPF 
TTAACTCTGGTATTAAGTTCTACAGAAGATACAAGGCTTTGGCTAGAAAGATTGTTCCAT 

15 421 + + + + + + 480 



AATTGAGACCATAATTCAAGATGTCTTCTATGTTCCGAAACCGATCTTTCTAACAAGGTA 

PP-a.7 

CP- 9 

IRASGS£RVIASAEKFIEGF 



20 TCATTAGAGCTTCTGGTTCT t C t AGAGTTATTGCTTCTGCTGAAAAGTTCATTGAAGGTT 

481 + + + + + + 540 

AGTAATCTCGAAGACCAAGAagaTCTCAATAACGAAGACGACTXTTCAAGTAACTTCCAA 

QSAKLADPGSQPHQASPVID 
25 TCCAATCTGCTAAGTTGGCTGACCCAGGTTCTCAACCACACCAAGCTTCTCCAGTTATTG 

541 + + + + + + 600 

AGGTTAGACGATTCAACCGACTGGGTCCAAGAGTTGGTGTGGTTCGAAGAGGTCAATAAC 

CP-10.7 

CF-I1.7 

30 VII£E&S£YNNTLDEGTCTA 
ACGTTATTATTtctGAcgctTCTtctTACAACAACACTTTGGACccaGGTACTTGTACTG 
gOl + + + + + + 660 



MocStage*<-^ 
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TGCAATAATAAagaCTgcgaAGGagaATGTTGTTGTQAAACCTGggtCCATGAACATGAC 



Modtage* 
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FEDSELADIVEANFTALFAP 
CTTTCGAAGACTCTGAATTGgc tGACac tGTTGAAGCTAACTTCACTGCTTTGTTCGCTC 

661 + + + + + + 720 

GAAAGCTTCTGAGACTTAACcgaCTG t g a CAACTTCGATTGAAGTGACGAAACAAGCGAG 
5 CP-12,7 
AIRARLEADLPGVTLTDIfiV 
CAGCTATTAGAGCTAGATTGGAAGCTGACTTGCCAGGTGTTACTTTGACTGACac t ga aG 

721 + + + + + + 780 

1 0 GTCGATAATCTCGATCTAACCTTCGACTG AACGGTCCACAATGAAACTGACTG tg ac 1 1 C 

CP-13.7 

XYLMDMCfiFETVARTSDATE 
TTac tTACTTGATGGACATGTGT tc tTTCGAAACTGTTGCTAGAACTTCTGACGCTACTG 

15 781 + + + + + + 840 

AAtgaATGAACTACCTGTACACAagaAAGCTTTGACAACGATCTTGAAGACTGCGATGAC 

LSPFCALFTHDEWRHYDYLQ 

AATTGTCTCCATTCTGTGCTTTGTTCACTCACGACGAATGGAGAcacTACGACTACTTGC 

20 841 + + + + + + 900 

TTAACAGAGGTAAGACACGAAACAAGTGAGTGCTGCTTACCTCTgtgATGCTGATGAACG 

CF-14.7 

CP-15.7 

SLJSKYYGHGAGNP LGPTQGV 
25 AATCTTTGa agAAGTACTACGGTcacGGTGCTGGTAACCCATTGGGTCCAac tCAAGGTG 

901 + + + + + + 960 

TTAGAAACttcTTCATGATGCCAgtgCCACGACCATTGGGTAACCCAGGTtgaGTTCCAC 

GFANELIARLTRSPVQDHTS 
30 TTGGTTTCGCTAACGAATTGATTGCTAGATTGACTAGATCTCCAGTTCAAGACCACACTT 

961 + + + + + + 

1020 

AACCAAAGCGATTGCTTAACTAACGATCTAACTGATCTAGAGGTCAAGTTCTGGTGTGAA 



109 



CP- 16 

CP-17.7 

TNHTLDSNPATFPLNATLYA 
CTACTAACCACACTTTGGACTCTAACCCAGCTACTTTCCCATTGAACGCTACTTTGTACG 



5 1021 + + + + + + 

1080 

GATGATTGGTGTGAAACCTGAGATTGGGTCGATGAAAGGGTAACTTGCGATGAAACATGC 

DFSHDNfillSIFFALGLYNG 
10 CTGACTTCTCTCACGACAACgg t a 1 1 ATTTCTATTTTCTTCGCTTTGGGTTTGTACAACG 

1081 + + + + + + 

1140 

GACTGAAGAGAGTGCTGTTGc ca t a aTAAAGATAAAAG AAGCGAAACCCAAACATGTTGC 

CP'18.7 

15 CP-19.7 

TAPLSTTSVESIEETDGYSS 
GTACTGCTCCATTGTCTACTACTTCTGTTGAATCTATTGAAGAAACTGACGGTTACTCTt 

1141 + + + + + + 

1200 

20 CATGACGAGGTAACAGATGATGAAGACAACTTAGATAACTTCTTTGACTGCCAATGAGAa 

£WTVPF&£RAYVEMMQCQAE 

C fcgc t TGG ACTGTTCCATTCgc 1 1 C tAG AGCTTACGTTGAAATGATGC AATGTCAAGCTG 

1201 + + + + + + 

25 1260 

g a cgaACCTGACAAGGTAAG cgaagaTCTCGAATGCAACTTT ACTACGTT ACAGTTCGAC 

CP-20 

CP-21 

KEPLVRVLVNDRVVPLHGCA 
30 AAAAGGAACCATTGGTTAGAGTTTTGGTTAACGACAGAGTTGTTCCATTGCACGGTTGTG 
1261 + + + + + + 

1320 

TTTTCCTTGGTAACCAATCTCAAAACCAATTGCTGTCTCAACAAGGTAACGTGCCAACAC 
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VDKLGRCKRDDFVEGLSFAR 
CTGTTGACAAGTTGGGTAGATGTAAGAGAGACGACTTCOTTGAAGGTTTGTCTTTCGCTA 

1321 + + + + + + 

1380 

5 GACAACTGTTCAACCCATCTACATTCTCTCTGCTGAAGCAACTTCCAAACAGAAAGCGAT 

CP-22 

SGGNWAECFA* Eco RI 
GATCTGGTGGTAACTGGGCTGAATGTTTCGCTTAAGAATTCATATA 

1381 + + + + 1426 

10 CTAGACCACCATTGACCCGACTTACAAAGCGAATTCTTAAGTATAT 
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Figure 11 

85.41 °C 



Consensus phytase-10 




10 20 30 40 50 60 70 80 90 100 °C 

-j 1 — , — i 1 — , — i 1 — , — i 1 — , — i 1 — , 




min 



] — i — i — i — i — | — i — » — i — i — | — i — i — « — i — | — r— i — i i | i — r— i — i — | — i — i — i — i — | — i — i — i — i — | — i — i — i — v 

10 20 30 40 50 60 70 80 °C 



ol IN*- 



- 112 - 



Figure 12 
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Consensus phytase-10 
thermo-Q50T-K91A 
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Figure 13 
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Figure 16 
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Figure 17 
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Modtaget PD 
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. A fumigatus 
a-mutant 
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Figure 20 
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Figure 21 

1 MGVFWLLSI ATLFGSTSGT ALGPRGNSHS CDTVDGGYQC FPEISSNWSP 
51 YSPXFSLADE SAISPDVPKG CRVTFVQVLQ RHGAREPTSfi AAIEISALIE 
101 AIQKNATAFK GKYAFLKTYN YTLGADDLVP FG ANQSSQA G IKFYRRYKAL 
151 ARKIVPFIRA SGSDRVIUSA TNWIEGFQSA KLADPGANPH QASPVINVII 
201 PEGAGYNNTL DHGLCTAFEE SELGDDVEAN FTAVFAPPIR ARLEAHLPGV 
251 NLTDEDWNL MDMCPFDTVA RTSDATELSP FCDLFTHDEW IQYDYLGDLD 
301 KYYGTGAGNP LGPAQGVGFV NELIARLTHS PVQDHTSTNH TLDSNPATFP 
351 LNATLYADFS HDNTMV&IFF ALGLYNGTKP LSTTSVESIE ETDGYSASWL 
401 VPFSARMYVE MMQCEAEKEP LVRVLVNDRV VPLHGCGVDK LGRCKRDDFV 
451 EGLSFARSGG NWEECFA 
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Abstract 



This invention relates to a new improved consensus phytase by introduction of 
5 additional phytase sequences into the sequence alignment and the method of 
the introduction process. Furthermore, the invention relates to the transfer of 
stabilizing amino acid exchanges found by the new method into homologous 
proteins. Furthermore, the invention relates to the replacement of a whole 
active site of a phytase. It also relates to the corresponding DNA sequences 
10 and its generation, methods to produce such phytases and the use thereof. 
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